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phage that bind to the particular target. The phages 
recovered in this way can be amplified and the 
selection repeated. The sequence of the peptide is 
decoded by sequencing the DNA. See for example Cwirla 
et al., Pr-nr.. Natl, ^r-.^*^ ^ Sci.. USA. 87:6378-6382 
(1990); Scott et al., science . 249:386-390 (1990); and 
Devlin et al./ science, 249:404-406 (1990). 

Another approach involves large arrays of 
peptides that are synthesized in parallel and screened 
with acceptor molecules labelled with fluorescent or 
other reporter groups. The sequence of any effective 
peptide can be decoded from its address in the array, 
see for example Geysen et al. , Proc. Natl. Acad. Scj.., 
USA . 81:3998-4002 (1984); Maeji et al., J. Immunol. 
15 Met. . 146:83-90 (1992); and Fodor et al.. Science , 

251:767-775 (1991) . 

In another approach. Lam et al.. Nature , 
354:82-84 (1991) describes combinatorial libraries of 
peptides that are synthesized on resin beads such that 
20 each resin bead contains about 20 pmoles of the same 

peptide. The beads are screened with labeled acceptor 
molecules and those with bound acceptor are searched 
for by visual inspection, physically removed, and the 
peptide identified by direct sequence analysis. In 
25 principle, this method could be used with other 

chemical entities but it requires sensitive methods 
for sequence determination- 

A different method of solving the problem of 
identification in a combinatorial peptide library is 
30 used by Houghten et al., liatHEe, 354:84-86 (1991). 

For hexapeptides of the 20 natural amino acids, 400 
separate libraries are synthesized, each with the 
first two amino acids fixed and the remaining four 
positions occupied by all possible combinations. 



An 
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assay, based on competition for binding or other 
activity, is then used to find the library with an 
active peptide. Twenty new libraries are then 
synthesized and assayed to determine the effective 
5 amino acid in the third position, and the process is 

reiterated in this fashion until the active 
hexapeptide is defined. This is analogous to the 
method used in searching a dictionary; the peptide is 
decoded by construction using a series of sieves or 

10 buckets and this makes the search logarithmic. 

Large libraries of wholly or partially synthetic 
antibody combining sites, or paratopes, have been 
constructed utilizing filamentous phage display 
vectors, referred to as phagemids, yielding large 

15 libraries of monoclonal antibodies having diverse and 

novel immunospecificities. The technology uses a 
filamentous phage coat protein membrane anchor domain 
as a means for linking gene-product and gene during 
the assembly stage of filamentous phage replication, 

20 and has been used for the cloning and expression of 

antibodies from combinatorial libraries. Kang et al. , 
Proc. Natl. Acad. Sci. . USA , 88:4363-4366 (1991). 
Combinatorial libraries of antibodies have been 
produced using both the cpVIII membrane anchor (Kang 

25 et al., suora ) and the cpIII membrane anchor. Barbas 
et al., Proc, Natl- Acad. Sci.. USA . 88:7978-7982 
(1991) . 

The diversity of a filamentous phage-based 
combinatorial antibody library can be increased by 
30 shuffling of the heavy and light chain genes (Kang et 
al., Proc, Natl. Acad. Sci,. USA , 88:11120-11123, 
1991) , by altering the complementarity determining 
region 3 (CDR3) of the cloned heavy chain genes of the 
library (Barbas et al., Proc. Nat l, Acad. Sci,. USA. 
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89:4457-4461, 1992), and by introducing random 
mutations into the library by error-prone polymerase 
chain reactions (PGR) (Gram et al., Proc. Natl. Acad. 
Sci. . USA . 89:3576-3580, 1992). 

Mutagenesis of proteins has been utilized to 
alter the function, and in some cases the binding 
specificity, of a protein. Typically, the 
mutagenesis is site-directed, and therefore laborious 
depending on the systematic choice of mutation to 
induce in the protein. See, for example Corey et al., 
J. Amer. Chem. Soc . 114:1784-1790 (1992), in which 
rat trypsins were modified by site-directed 
mutagenesis . Partial randomization of selected codons 
in the thymidine kinase (TK) gene was used as a 
mutagenesis procedure to develop variant TK proteins. 
Munir et al., .t. Bjol. Chem. . 267:6584-6589 (1992). 

In one approach, Roberts et al.. Gene, 121:9-15 
(1992), described the point mutation of a protease 
inhibitor (BPTI) as a fusion protein with gene III of 
a phagemid, and demonstrated a change in binding 
specificity such that the mutant binds human 
neutrophil elastase rather than trypsin. Similarly, 
Roberts et al., Proc. Hat i - Acad. Sci.. USA. 
89:2429-2433 (1992), produced by mutagenesis, a 
library of phage displaying mutant trypsin inhibitor, 
and isolated variant enzymes with increased affinity. 

While mutagenesis has not been utilized to create 
metal binding sites in proteins, the sequence encoding 
a metal binding protein has been engineered into an 
antibody-encoding construct, the expression of which 
forms a chimeric protein as described by Das et al . , 
Proc. Natl- Arari. Sci.. USA . 89:9749-9753 (1992) and 
Sawyer et al. , Proc. Natl. Acad l Sci.. USA. 
89:9754-9758 (1992). Freeman et al.. Adv. Protein 
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Chem. . 22:257-424 (1967), has described the 
preparation of synthetic metal binding sites on 
polypeptides. In addition, nonapeptide sequences that 
were inserted into an exposed portion of the lain B 
5 protein which allowed for the selection of E. coli 
that adhered to the iron oxide, magnetite, as 
described by Brown, Pror Natl. Arad. Sci., USA , 89: 
9651-8655 (1992) . Naturally occurring bacteria that 
accumulate magnetite have also been identified as 

10 described by Matsunag, Trends in Biotechnology , 9: 91- 
95 (1991) . Alterations of existing metal loproteins 
have been reported in which amino acid residues that 
participate in the metal-protein complex were 
substituted, resulting in changes in the apoprotein's 

15 metal ion specificity and in the binding constant for 
the metal ion. However, no protein structure has been 
engineered to allow for the ability to actively select 
for a metal binding site in a protein. 

20 RT-iP,f Desf ^-ri ption Tnvention 

Methods have now been discovered using the 
phagemid vectors to produce metal binding sites 
capable of binding (interacting with) any of a large 
variety of target metals. 

25 Thus, in one embodiment, the invention describes 

a method for producing in a polypeptide a binding site 
capable of binding a preselected agent comprising 
introducing a nucleotide sequence that codes for an 
amino acid residue sequence defining the binding site 

30 into a CDR of a nucleic acid comprising an 
immunoglobulin heavy or light chain gene. 

In particular, the invention describes a method 
for producing a metal binding site in a polypeptide 
capable of binding a preselected metal. An 
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Oligonucleotide useful as a primer is used in a primer 
extension reaction for producing a metal, binding site 
in a polypeptide coded for by a hybrid immunoglobulin 
heavy or light chain gene by amplifying the CDR of a 
5 nucleic acid of the immunoglobulin gene. The 

oligonucleotide has 5' and 3' termini and comprises: 

a) a hucieotide sequence of about 5 to 50 
nucleotides in length at the 3' termini capable of 
hybridizing to a first framework region of the 

10 immunoglobulin gene; 

b) a nucleotide sequence of about 5 to 50 
nucleotides in length at the 5 • termini capable of 
hybridizing to a second framework region of the 
immunoglobulin gene; and 

3^5 c) a nucleotide sequence between said 5' and 3* 

termini according to the formula: 

[NNS]., 

where a is a whole integer from 3 to 50, N is 
independently any nucleotide, S is cytosine (C) or 

20 guanosine (G) or analogs thereof, and wherein said 5' 

and 3' terminal nucleotide sequences have a length of 
about 6 to 50 nucleotides in length, and sequences 
complementary thereto. 

The choice of framework regions depends on the 

25 CDR into which the binding site is to be inserted. 

Thus, for example, for an insertion into CDR3, the 3' 
and 5' regions of the oligonucleotides are selected as 
to be complementary in nucleotide sequence to the 
coding strand defining FR4 and FR3 that flank CDR3, 

30 respectively, where the oligonucleotide is to be 

complementary to the noncoding (anti-sense) strand of 
the template DNA. 

A preferred and exemplary CDR for insertion of a 
binding site is the CDR3 of immunoglobulin heavy 
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Chain, particularly preferred is the immunoglobulin 
heavy chain display protein present in the vectors 
PC3AP313 and P7EIII, described herein. 

Additionally contemplated for use in this 
5 invention is the complementary nucleotide sequence of 

the preferred oligonucleotide formulation and 
identified sequences. Specifically, the complementary 
oligonucleotide has 5' and 3- termini between which is 
the nucleotide formulation [SNN].. This alternative 
10 embodiment of a complementary oligonucleotide thus 
hybridizes to the coding (sense) strand of the 

template DNA. 

In another embodiment, the invention contemplates 

a method for producing in a polypeptide a metal 
15 binding site capable of binding a preselected metal 

wherein the binding site is in an immunoglobulin where 
two amino acid residues of the gene are retained. An 
oligonucleotide useful as a primer is used in a primer 
extension reaction for producing a metal binding site 
20 in a polypeptide encoded by a hybrid immunoglobulin 
heavy or light chain gene by amplifying the CDR of a 
nucleic acid of the immunoglobulin heavy or light 
chain gene. The oligonucleotide has 5« and 3« termini 
and comprises: 

25 a) a nucleotide sequence of about 5 to 50 

nucleotides in length at the 3 • termini capable of 
hybridizing to a first framework region of the 
immunoglobulin gene; 

b) a nucleotide sequence of about 5 to 50 
30 nucleotides in length at the 5 • termini capable of 

hybridizing to a second framework region of the 
immunoglobulin gene; and 

c) a nucleotide sequence between said 5' and 3' 
termini according to the formula: 
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^X- [ NNK ] a-X- [ NNK ] -X- , 
where a is a whole integer from 3 to 50, X is a 
trinucleotide encoding a native amino acid residue 
coded by the immunoglobulin gene, N is independently 
5 any nucleotide, K is guanosine (G) or thymine (T) or 
analogs thereof, wherein said internal X is one 
nucleotide triplet encoding one immunoglobulin gene 
amino acid residue and wherein said 5' and 3' terminal 
nucleotide sequences have a length of about 6 to 50 
10 nucleotides in length, and sequences complementary 
thereto. 

The methods of this invention further comprises 

the steps of: 

a) isolating the amplified CDR to form 
15 mutagenized immunoglobulin genes; 

b) expressing the isolated mutagenized 
immunoglobulin genes; and 

c) selecting species of the expressed 
mutagenized immunoglobulin genes for the ability to 

20 bind a preselected metal ion-containing molecule. 

Contemplated for use in methods for producing a 
metal binding site is a preselected metal ion- 
containing molecule is selected from the group 
consisting of magnetite, copper (II) , zinc(II), 

25 lead(II), cerium ( III ) , and iron ( III ) . 

This invention describes human monoclonal 
antibodies produced according to the methods capable 
of immunoreacting with magnetite, copper (II), 
zinc(II), lead(II), cerium ( III ) , and iron(Iir) , 

30 wherein the monoclonal antibody has the binding 

specificity of a monoclonal antibody comprising a . 
heavy chain immunoglobulin variable region amino acid 
residue sequence. 

in another embodiment, the invention contemplates 
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pharmaceutical compositions comprising at least one 
dose of an immuriotherapeutically effective amount of 
the human monoclonal antibodies containing metal 
binding sites. 

5 Also described is a DNA expression vector capable 

of expressing a phagemid immunoglobulin display 
protein comprising a polynucleotide sequence that 
codes an immunoglobulin heavy chain polypeptide that 
includes in the CDR3 portion of the heavy chain a 
10 binding site able to bind a preselected target 
molecule. 

A major advantage of the methods of the invention 
derives from the fact that new proteins having new 
metal binding specificities can be rapidly developed 
15 for use in a variety of ways. 

Brief Description of the Drawings 

In the drawings forming a portion of this 
disclosure: 

20 Figure 1 is a graph of the comparison of binding 

specificities of three purified clones that were 
selected for Cu(II) binding. Specificity was examined 
by ELISA. Antigens were (from left to right) Mg(II) 
BSA-IDA, Ni(II) BSA-IDA, Cu(II) BSA-IDA, Zn(II) 

25 BSA-IDA, and BSA-IDA (no metal) . 

Figure 2 is a schematic representation of an 
iterative strategy for selection of catalytic 
synthetic antibodies. Figure 2A depicts an antibody 
binding site formed by the dimerization of heavy and 

30 light chains. The hatched region indicates an 

antibody CDR which has been targeted for mutagenesis. 
The resulting library of antibodies are then selected 
to bind the metal-iminodiacetic acid complex shown at 
the top of panel A where X designates a vacant (HjO) 
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coordination site. The antibodies which result are 
shown in Figure 2B where the vacant X sites of the 
complex shown in panel A are substituted for His 
ligands in this example. The coordination sites 
5 formerly occupied by IDA are now vacant (X). Figure 2C 
depicts the diversification of the metal binding 
antibodies by mutagenesis of another CDR or chain 
shuffling. The resulting sublibrary is then selected 
for binding to an appropriately designed hapten to 
10 produce catalytic metalloantibodies shown in Figure 2D 
to bind both metal and hapten. 

Detailed Description of the Invention 
A. Definitions 

15 Amino Acid Residue : An amino acid formed 

upon chemical digestion (hydrolysis) of a polypeptide 
at its peptide linkages. The amino acid residues 
described herein are preferably in the "L" isomeric 
form. However, residues in the "D" isomeric form can 

20 be substituted for any L-amino acid residue, as long 
as the desired functional property is retained by the 
polypeptide. NH^ refers to the free amino group 
present at the amino terminus of a polypeptide. COOH 
refers to the free carboxy group present at the 

25 carboxy terminus of a polypeptide. In keeping with 

standard polypeptide nomenclature (described in J^. 
Biol. Chem. . 24 3:3552-59 (1969) and adopted at 37 CFR 
§1.822 (b) (2) ) , abbreviations for amino acid residues 
are shown in the following Table of Correspondence: 



30 
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TABLE OF CORRESPONDENCE 
SYMBOL AMINO ACID 



5 



10 



15 



20 



25 



1 — T.^I't'^T 


3 -Letter 




V 


Tvr 

X jr J. 


tyrosine 


G 

w 


Gly 


glycine 


F 


Phe 


phenylalanine 


M 


Met 


methionine 


TV 


Ala 


alanine 


c 




serine 


X 


T 1 ^ 


isoleucine 


L 




leucine 


T 
X 


Thr 


threonine 


V 


Val 


valine 


It 


PlTO 


proline 


rr 
K 


T.VQ 


lysine 


u 
n 


His 


histidine 


W 


Gin 


glutamine 


T? 
Cs 


Glu 


glutcunic acid 


*7 


GlX 


Glu and/ or Gin 


w 


Trp 


tryptophan 


R 


Arg 


arginine 


D 


Asp 


aspartic acid 


N 


Asn 


asparagine 


B 


Asx 


Asn and/ or Asp 


C 


Cys 


cysteine 


X 


Xaa 


Unknown or other 


It should 


be noted that all 


amino acid residue 



sequences represented herein by formulae have a 
left-to-right orientation in the conventional 
direction of amino terminus to carboxy terminus. In 
addition, the phrase "amino acid residue" is broadly 
defined to include the amino acids listed in the Table 
of Correspondence and modified and unusual amino 
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acids, such as those listed in 37 CFR 1.822(b)(4), and 
incorporated herein by reference. Furthermore, it 
should be noted that a dash at the beginning or end of 
an amino acid residue sequence indicates a peptide 
5 bond to a further sequence of one or more amino acid 

residues or a covalent bond to an amino-terminal group 
such as NHj or acetyl or to a car boxy- terminal group 
such as COOH. 

Recombinant DNA (rPNA) Molecule : A DNA molecule 

10 produced by operatively linking two DNA segments. 
Thus, a recombinant DNA molecule is a hybrid DNA 
molecule comprising at least two nucleotide sequences 
not normally found together in nature. rDNA's not 
having a common biological origin, i.e., 

15 evolutionarily different, are said to be 

"heterologous". 

Vector : A rDNA molecule capable of autonomous 
replication in a cell and to which a DNA segment, 
e.g., gene or polynucleotide, can be operatively 

20 linked so as to bring about replication of the 

attached segment. Vectors capable of directing the 
expression of genes encoding for one or more 
polypeptides are referred to herein as "expression 
vectors". Particularly important vectors allow 

25 cloning of cDNA (complementary DNA) from mRNAs 

produced using reverse transcriptase. 

Receptor : A receptor is a molecule, such as a 
protein, glycoprotein and the like, that can 
specifically (non-randomly) bind to another molecule. 

30 Antibody : The term antibody in its various 

grammatical forms is used herein to refer to 
immunoglobulin molecules and immunologically active 
portions of immunoglobulin molecules, i.e., molecules 
that contain an antibody combining site or paratope. 
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Exemplary antibody molecules are intact immunoglobulin 
molecules, substantially intact immunoglobulin 
molecules and portions of an immunoglobulin molecule, 
including those portions known in the art as Fab, 

5 Fab', F(ab')2 and F(v) . 

Antibody rmnbinina Site : An antibody combining 
site is that structural portion of an antibody 
molecule comprised of a heavy and light chain variable 
and hypervariable regions that specifically binds 

10 (immunoreacts with) an antigen. The term immunoreact 

in its various fonas means specific binding between an 
antigenic determinant-containing molecule and a 
molecule containing an antibody combining site such as 
a whole antibody molecule or a portion thereof. 

15 Monoclonal Antibody : A monoclonal antibody in 

its various grammatical forms refers to a population 
of antibody molecules that contain only one species of 
antibody combining site capable of immunoreacting with 
a particular epitope. A monoclonal antibody thus 

20 typically displays a single binding affinity for any 
epitope with which it immunoreacts. A monoclonal 
antibody may therefore contain an antibody molecule 
having a plurality of antibody combining sites, each 
immunospecif ic for a different epitope, e.g., a 

25 bispecific monoclonal antibody. Although historically 

a monoclonal antibody was produced by immortalization 
of a clonally pure immunoglobulin secreting cell line, 
a monoclonal ly pure population of antibody molecules 
can also be prepared by the methods of the present 

30 invention. 

Fusion Polypeptide ; A polypeptide comprised of 
at least two polypeptides and a linking sequence to 
operatively link the two polypeptides into one 
continuous polypeptide. The two polypeptides linked 
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in a fusion polypeptide are typically derived from two 
independent sources, and therefore a fusion 
polypeptide comprises two linked polypeptides not 
normally found linked in nature. 

Upstream : In the direction opposite to the 
direction of DNA transcription, and therefore going 
from 5 • to 3 • on the noncodingstrand , or 3 • to 5 • on 
the mRNA. 

Downstream ; Further along a DNA sequence in the 
direction of sequence transcription or read out, that 
is traveling in a 3'- to 5 '-direction along the 
noncodingstrand of the DNA or 5'- to 3 '-direction 
along the RNA transcript. 

Cistron: A sequence of nucleotides in a DNA 
molecule coding for an amino acid residue sequence and 
including upstream and downstream DNA expression 
control elements. 

Leader Polypeptide : A short length of amino acid 
sequence at the amino end of a polypeptide, which 
carries or directs the polypeptide through the inner 
membrane and so ensures its eventual secretion into 
the periplasmic space and perhaps beyond- The leader 
sequence peptide is commonly removed before the 
polypeptide becomes active. 

Reading Frame : A particular sequence of 
contiguous nucleotide triplets (codons) employed in 
translation- The reading frame depends on the 
location of the translation initiation codon* 

B. Structure of a Polypeptide Metal Binding 
site 

A binding site, and more particularly a 
metal binding site as described in more detail below, 
is any region of a protein or polypeptide that 
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participates in protein - target molecule 
interactions, and therefore the identification of the 
primary sequence of a binding site is important in 
constructing a functional binding site-containing 

5 molecule. 

The display of a binding site on a filamentous 
phage is essential to the present invention as it 
provides the ability to screen for capacity of the 
displayed binding site to bind to a preselected target 
10 molecule. Display is described in more detail but 

generally involves the preparation of a fusion protein 
containing a membrane anchor of the filamentous phage 
gene III or gene VIII protein fused to the polypeptide 
to be displayed that forms a binding site to be 
15 tested. Peptide display vectors using hexapeptides 

have been generally described before by Cwirla et al., 
Proc. Natl, ^^^-^f^ - Sci . . USA . 87:6378-6382 (1990); 
Scott et al., science . 249:386-390 (1990); and Devlin 
et al., science . 249:404-406 (1990). 

According to the present methods, the use of an 
antibody heavy or light chain as the display support 
structure on a recombinant filamentous phage, or 
phagemid, is particularly preferred. The use of an 
antibody support for a binding site provides a 
25 "scaffold" for presenting a conformationally 

constrained polypeptide to preselected binding 
conditions. In this embodiment, the CDR of an 
immunoglobulin heavy or light chain is randomly 
mutated to result in a metal binding site that is 
30 presented on a phagemid, and the phagemid can then be 
screened for binding to a preselected target molecule. 

The introduction of a binding site onto a display 
phagemid involves the use of degenerate 
oligonucleotides to mutate an immunoglobulin CDR 



20 
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region, followed by a selection step. The methods of 
this invention result in mutated antibody populations 
that have metal binding sites for binding to 
preselected metal cations (also referred to as 
5 metals) > as described herein. The degenerate 

oligonucleotides contain regions of degeneracy to 
produce a library of different binding site structures 
within a CDR of a variable region in a heavy or light 
chain. 

10 The region of a preselected phagemid display 

protein gene is first amplified by a primer extension 
reaction using a primer oligonucleotide having (1) 
regions complementary to the display protein gene, and 
(2) regions of degeneracy that introduce the 

15 variability into the resulting pool of display 

proteins for selecting metal binding specificities. 

After the primer extension reaction, which may be 
accomplished in a variety of modes including 
polymerase chain reaction (PCR) , crossover PCR, and 

20 the like, the resulting population of display protein 
genes are expressed in phagemids to form a population 
of phagemid particles having the display proteins with 
binding sites on the particle surface. The population 
of particles are then screened for the presence of 

25 particles containing a metal binding site that binds 
to a preselected metal target molecule. 

1. Metal Binding Sit e Polypeptides 

A metal binding site, also referred to 
30 as a metal cation binding site, can be any polypeptide 
sequence, typically about 3 to 50 amino acid residues 
in length, that defines a region of a protein or 
polypeptide that selectively interacts with another 
molecule or family of related molecules, referred to 
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as target molecules that contain a metal moiety. 

As is known in protein biochemistry, proteins, 
and therefore binding sites as defined herein, can 
interact with a wide diversity of molecules including 
5 other proteins, in the form of receptors, tissue 
structures, and soluble proteins, polypeptides 
including degraded proteins, polypeptide hormones and 
ligands, lipid, oligosaccharides and carbohydrates, 
nucleic acids and inorganic molecules such as metal 

10 ions. The latter is the preferred target molecule for 
use in selecting metal binding site-containing 
polypeptides of this invention. 

As shown herein and by other work in the field, 
the amino acid residue sequence of a binding site can 

15 tolerate some degree of variability and still retain a 

degree of capacity to bind the target molecule. 
Furthermore, changes in the sequence can result in 
changes in the binding specificity and in the binding 
constant between a preselected target molecule and the 

20 binding site. 

The metal cation binding sites of this invention 
are presented in the context of an immunoglobulin that 
represents a large family of molecules that include 
several types of molecules, such as IgD, IgG, IgA, IgM 

25 and IgE. An immunoglobulin is typically comprised of 
two heavy (H) and two light (L) chains with both a 
variable (V) and constant (C) region present on each 
chain. Several different regions of an immunoglobulin 
contain conserved sequences on the basis of 

30 comparative studies of known sequences of heavy or 

light chains. 

Extensive amino acid and nucleic acid sequence 
data displaying exemplary conserved sequences have' 
been compiled for immunoglobulin molecules by Rabat et 
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ai., in sequences of Proteins o f immunological 
Interest . 4th Ed., National Institutes of Health, 
Bethesda, MD (1987) . 

Present understanding of the sites of an antibody 
molecule responsible for antigen-antibody binding 
indicates that part of the antibody combining site is 
formed by heavy chain hypervariable regions, and part 
of the combining site is formed by the light chain 
hypervariable regions. See for example, Getzoff et 
al., Adv. Immunol. . 43:1-98 (1988). Six loops of 
polypeptide comprise the hypervariable regions; three 
loops from the variable region of the light chain (V^) 
and three loops from the variable region of the heavy 
chain (V„) , denoted LI, L2, L3 and HI, H2, H3 
respectively. See for example, Chothia et al. , 
Nature . 342:877-883 (1989). The hypervariable regions 
are also known as complementarity determining regions, 
or CDRs. 

Comparative studies of the known three 
dimensional structure of numerous antibody molecules 
have identified that each hypervariable region adopts 
one of a few main chain conformations or canonical 
structures. Using sequence homologies, the amino acid 
residue sequence of a heavy or light chain are aligned 
with the sequence of a known immunoglobulin heavy or 
light chain structure, respectively, the hypervariable 
region loops and specific amino acid residue positions 
within the loops, of the heavy or light chain can be 
reproducibly identified. 

Thus, for example, a LI, L2 or L3 loop structure 
(CDR LI, CDR L2 or CDR L3) can be reproducibly 
identified solely on the basis of sequence homologies 
to other immunoglobulin light chains, thereby locating 
the position of critical residue positions within the 
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loop structure. Due to the existence of variation in 
the chain length for a particular immunoglobulin light 
chain, amino acid residue position numbers are 
referred to herein by a numbering scheme that is based 
5 on alignments using homologous sequences as described 
by Rabat et al . , in Sequence s of Proteins of 
Immunological Interest . 4th Ed., U.S. Department of 
Health and Human Services, National Institute of 
Health, Bethesda, MD (1987). Specific residue ntamber 

10 reference to the amino acid residue positions for 

particular residues, therefore, will be cited herein 
as "Kabat position number" or "Kabat amino acid 
residue position number" to connote a reproducibly 
identifiable residue position on a recognized CDR loop 

15 structure. Wherever position numbers are given, they 

refer to Kabat positions. 

The V region of any immunoglobulin heavy or light 
chain molecule having the identifiable loop structures 
are useful in the present invention to produce a metal 

20 cation binding site. 

A metal cation binding site is formed by the 
reproducible folding of a V region into its 
characteristic folded structure engineered to contain 
specific contact amino acid residues. The metal 

25 binding site is formed by the geometric positioning of 
three metal ligands (contact sites) provided by the 
side chain residues of three contact amino acid 
residues to f ormi coordinating ligands for complexing a 
metal cation. Thus, the contact amino acid residues 

30 in the CDR amino acid residue sequence of an 

immunoglobulin V region defines a metal binding site. 

The structure and stereochemistry of 
protein-metal interactions in metalloproteins is 
generally well understood. See for example. Freeman 
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etal., Adv. Protein Chem. , 22:257-424 (1967); Kannan 
et al., Annals. NY Acad. Sci , . 429:49 (1984); and 
Taineretal., J> Mol. Biol> . 160:181-217 (1982). 
In the conventionally known models of metal 
5 ligand binding, the metal ligands (contact sites) for 
binding a metal cation to a metal binding site in a V 
region are positioned at three locations to provide 
three ligand contact points typically required for a 
metal cation coordination complex. Representative 

10 coordination complex geometries for the metal ligands 
can potentially be tetrahedral, square planar or 
trigonal depending upon the metal cation. However, 
tetrahedral geometries are preferred. Representative 
coordination metal complexes of the preferred 

15 tetrcihedral coordinating geometry are shown by the 
structure of the zinc (II) complex in the enzyme 
superoxide dismutase, or the copper (II) complex in 
carbonic anhydrase. See Tainer et al., J . Mol, Biol, , 
160:181-217 (1982); and Kannan et al., Annals, NY 

20 Acad. Sci. . 429:49 (1984). In these referenced 

examples, although a metal cation presents four 
potential contact sites, typically three participate 
in the metal-ligand contact, and the fourth site on 
the metal cation is free to participate in electron 

25 exchanges or sharing with solvent or solute in 

solution having access to the complexed metal. 

In the predicted metal binding contact sites 
described above, an amino acid residue that occupies 
one of the three amino acid residue positions to 

30 provide a metal ligand contact site in a metal binding 

site is referred to as a contact amino acid residue. 
Amino acid residues usually found as contact amino 
acid residues are known in the art of roetalloprotein 
biochemistry and include histidine, cysteine. 
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methionine^ aspartic acid, glutamic acid and the like 
residues known to provide a ligand for metal cations 
in aetalloproteins. 

Metal binding sites of this invention have been 
5 prepared in V regions using randomized oligonucleotide 

sequences to mutate immunoglobulin CDRs, followed by 
selection of the resulting mutated antibody 
populations for binding to preselected metal cations, 
as described herein. 

10 Thus, in one embodiment, a mutagenized CDR lias 

the ability to complex with (bind) a metal cation 
through ligands (contacts) provided by contact amino 
acid residues in the polypeptide sequence in the CDR. 
The contact amino acid residues are presented in a 

15 geometry that coordinates the complexation of a metal 

cation. 

The mutagenized CDR contains a sequence of amino 
acids that results in a site that binds metal cation 
and is therefore referred to as a metal cation binding 

20 site. A metal cation is bound because the randomly 
mutagenized CDR is capable of forming a coordination 
complex with the metal cation. 

As described further herein, a preferred and 
exemplary phage display protein is an immunoglobulin 

25 heterodimer in which the fusion to a phagemid membrane 
anchor is through an immunoglobulin heavy chain 
polypeptide. In addition, a human monoclonal antibody 
in the form of a soluble Fab fragment can readily be 
prepared from the phagemid display vector. In this 

30 regard, the resulting human monoclonal antibody (Mab) , 

whether a Mab, Fab and the like, produced having, for 
example, a metal binding site that binds to copper (II) 
(Cu2+) for example is referred to generally as Mab Cu^* 
to connote the presence of the copper-reactive metal 
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binding site. 

Preferred metal cation binding sites shown to 
bind magnetite (FejO^) , also designated Mg, in 
phagemid display proteins or in soluble forms of the 
5 antibodies thereof have been identified by the present 

methods and have the individual amino acid residue 
sequences as follows: 

Designation rSEO ID NO^ Amino Acid Residue Sequence 
Mg-1 1 SRRSRHH PRMWNGLDV 

10 Mg-2 2 GRFKRVRDRW WIFDF 

Mg-3 3 GV ARSKKMR GLWRLDV 

Mg-4 4 GIA VRSKRGR FFLFDV 

The underlined amino acid residues in antibodies 
Mg-1, Mg-3 and Mg-4 highlight the similarities between 

15 the phagemid clones and the expressed phagemid display 

proteins. The underlined amino acid residues in Mg-2 
indicates a novel basic region. The RRSRHH- and 
RSKRGR- containing sequences were each found 3 times 
with identical sequences at nucleotide level. For all 

20 the magnetite-specific binding antibodies, the first 
and penultimate amino acid residues are fixed as 
glycine (G) and aspartic acid (D) , respectively. 
Consensus features are apparent in all the 
magnetite-specific antibodies. The amino terminal 

25 portion of all the loops are rich in basic amino acids 

whereas the carboxy terminus is rich in hydrophobic 
residues. Three of four clones bear arginine (R) at 
position 9 and valine (V) at position 16. Two clones 
have an RSK triplet and one a RSR triplet. 

30 . Other preferred metal cation binding sites shown 

to bind copper(II), designated Cu2+» in phagemid 
display proteins or in soluble forms of the antibodies 
thereof have been identified by the present methods 
and have the individual amino acid residue sequences 
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Cu-2 
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CU-3 
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CU-5 
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Cu-6 


10 


CU-7 


11 


Cu-8 


12 


Cu-9 


13 


Cu-10 


14 


Cu-11 


15 
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as follows: 

n^sicmation rsF.Q ID NO) Ami no Acid Pes i due Sequence 

GRVHHHSLDV 
SWKHHAHWDV 
GSWDHRGCDG 
GHHHYGGWDH 
GHWGBHSLDT 
6HILHHQLDL 
SSQKLMLGDN 

10 Cu-8 12 SHHGHHYLNH 

GKLMMSWCRDTEGCDH 
GDTHRGHLRHHLPHDW 
GWGLWMKPFVWRAWDM 
The copper-reactive antibodies exhibit unique 
15 specificity characteristics resulting from the random 
selection procedure methods of this invention. 
Interestingly, two of these antibodies, Cu-1 and Cu-2, 
exhibit a marked preference for nickel (Ni(II)) over 
all the other metals examined as described in Example 
20 4. These unique binding preferences highlight the 
diversity which still exists in the metal binding 
sublibrary obtained by the methods as described 
herein. Analysis of the sequences of the selected 
antibodies confirms the success of the selection 

25 strategy. 

With the Cu(II) chelating sequences, based on the 
knowledge of characterized copper-containing protein 
structures, the expected ligands are His, Met, and 
Cys. A total of 106 positions within the 11 preferred 

30 cu( II) -selected CDRs were randomized with the 
oligonucleotide having the repeated degenerate 
formulation NNK in the mutagenesis procedure. 
Histidine was selected at 26 positions. Met at 6, and 
Cys at 3. The NNK mix provides each of these residues 
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at only one part in 32 in the unselected library- One 
clone contains an unpaired Cys residue which is a rare 
feature in antibody CDRs. Three clones contain a His 
at the heavy chain CDR3 position 102 which is 
5 predicted not to be on the surface of the protein. 

The two His-rich clones demonstrated a preference for 
Ni(II) while only the Trp rich clone showed 
selectivity for Cu(II) . 

Additionally preferred metal cation binding sites 
10 shown to bind zinc (II), designated Zn^^' in phagemid 

display proteins or in soluble forms of the antibodies 
thereof have been identified by the present methods 
and have the individual amino acid residue sequences 
as follows: 

15 Designation rSEO ID NO^ Amino Acid Resid ue Sequence 



Zn-1 16 SHTHALPLDF 

Zn-2 17 GRVHHHSLDV 

Zn-3 18 GQSSGGDTDD 

Zn-4 19 GQWTPRGDDF 

20 Zn-5 20 GRCCPSSCDE 

Zn-6 21 GPAKHRHRHVGQMHDS 



Further preferred metal cation binding sites 
shown to bind lead (11), designated Pb^-^' in phagemid 
display proteins or in soluble forms of the antibodies 
25 thereof have been identified by the present methods 

and have the individual amino acid residue sequences 
as follows: 

Designation (SEO ID NO) Amino Acid Residue Sequence 



Pb- 1 22 GNLRRKTSDI 

30 Pb-2 23 GESDSKREDG 

pb-3 24 GGPSLAVGDW 

pb-4 25 GPLQHTYPDY 

pb-5 26 GWKVTAEDSTEGLFDL 

Pb-6 27 GTRVWRVCQWNHEEDG 
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10 



pj,_7 28 GEWWCSFAMCPARVTOF 

pi)_8 29 GDTIFGVTMGYYAMDV 

Further preferred metal cation binding sites 
shown to bind cerium (III) hydroxide, designated Ce^*. 
in phagemid display proteins or in soluble forms of 
the antibodies thereof have been identified by the 
present methods and have the individual amino acid 
residue sequences as follows: 

npsianation rsRO ID NO^ ftmino Acjd Be.sidue Sequence 
Ce-i 30 GQVMQELGDA 

Ce-2 31 GLTEQQLQDG 

Ce-3 32 GYSYSVSPDA 

Ce-4 33 GRLGLVMTDE 

Cg_5 34 STWPGRQRLGQALSDS 

3^5 ce-6 35 GYELSWGVDQQEWWDI 

Ce-7 36 GPVRGLDQSKGVRYDN 

Cg.g 37 GLSQHIVSETQSSGDL 

Ce-9 38 GLESLKVLGVQLGGDL 

Ce-lO 39 GNMILGGPGCWSSADI 

20 Ce-11 40 GCWNVQRLWYHPPDG 

Ce-12 41 GFEVTCSWFGHWGRDS 

Further preferred metal cation binding sites 
shown to bind iron (III) chloride, designated Fe'*. in 
phagemid display proteins or in soluble forms of the 
25 antibodies thereof have been identified by the present 
methods and have the individual amino acid residue 
sequences as follows: 

Designation rsKO ID NO) atditio Acid Residue Sequence 
Fe-l 42 SASMRSAIGLWRTMDY 

j.e_2 43 GDREIFHMQWPLRVDV 

Fe-3 44 SQNPQQVCGVRCGQDK 

Fe-4 45 GNRLSSGHLLKQGQDG 

Fe-5 46 GGSDWQIGACCREDDL 

Pe_5 47 6MVSMMGQSRPTQCDC 



30 



Fe-7 48 GVIKWIRRWVRTARDV 

Fe-8 49 GWFWRLLPTPRAPSDV 

As described above for Cu-reactive antibodies, 
those reactive antibodies selected for binding to 
Zn(II), Pb(II), Ce(III), and Fe(III) exhibit unique 
sequence characteristics. Zn (II) -selected sequences 
show the His rich character seen for Cu(II) and one 
sequence which is identical to that selected with 
Cu(II). Only one clone is rich in carboxy lates . 
Selection for His, Cys, Asp, and Glu is expected based 
on studies of natural proteins as described by Vallee 
et al., Biochem, . 29:5647-5659 (1990). The 
coordinating ligands from sequences selected for 
binding to Pb(II) , Ce(III) , and Fe(III) are less 
obvious. A number of features are apparent. Pb(Il) 
sequences are rich in Asp and Glu (10% of randomized 
residues). For comparison^ Ce(III), Fe(III), and 
Cu(II) selected sequences contained these residues at 
7, 4, and 5% respectively. The oxygen-containing 
ligands Ser, Thr, and Tyr constitute 18% of Pb(II) and 
Ce(III) sequences as compared to 11 and 9% for Fe(III) 
and Cu(II). The Ser and Thr amino acid residues are 
over-represented at the synthesis level and are 
present at 3 and 2 parts per 32. Fe (III) -reactive 
sequences are enriched to the 10% level for the sulfur 
containing residues Cys and Met. 

Thus, for all the metal binding antibodies of 
this invention described above, within the pool of 20 
commonly occurring amino acids, almost half have been 
observed to participate in metal ligation within 
proteins (Asp, Cys, Glu, His, Met, Ser, Thr, Tyr, and 
Trp) . Main chain carbonyl oxygens and amide nitrogens 
can also be utilized to coordinate metals. Most 
protein engineering efforts utilize design strategies 
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based on analogy as described by Tainer et al., Curr^ 
n pSn, Biotech. . 2:582-591 (1991). While such a 
strategy may be sufficient for the transfer of known 
metal binding motifs into alternative proteins, it 
5 limits the ability to explore and exploit novel 

reactivities for most of the periodic table. 

The ability to selectively sort proteins in vitro 
for metal ligation using the methods of this invention 
provides the distinct advantage of overcoming the 
10 limitations in scope resulting from natural 

phylogenies. Furthermore, the randomization and 
selection methods used in obtaining the metal-binding 
antibodies of the instant invention generate a variety 
of ligating groups which provide a multiplicity of 
15 coordination numbers and geometries resulting in the 

alteration of the reactivity of the bound metal. 

A further embodiment of this invention are 
antibodies containing metal binding sites selected 
against compounds containing the elements listed in 
20 the Periodic Table of the Elements under columns 3-14 
as described in rbPinical ? nH Fnaineerina News, 63:27 
(1985) and as described in Section C5 below. 



25 



1. fieneral Ufi^-s For M etal Binding Site- 

rnn^-ainina pnlvpeptirt pq and Methods of 
Preparation . 

The present inventions provide a 
variety of utilities, as discussed further herein. In 
particular, a metal binding site polypeptide, and more 
30 particularly a monoclonal antibody of this invention, 
provides a unique reagent having both therapeutic and 
diagnostic utilities. 

Insofar as a polypeptide or antibody of this 
invention binds to a preselected metal ibn-containing 
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molecule, it may be used as a reagent to bind to the 
metal ion as a metal ion chelator. In this way the 
reagent is useful for the isolation of metal ions and 
metal ion-containing molecules, as in purification 
5 procedures where the presence of metal ion is 
undesirable . 

Alternatively, the presence of a metal ion 
binding site may be used as a means for detection the 
presence of the polypeptide by detecting the presence 
10 of a metal ion associated with the polypeptide to 

which it is bound. Exemplary methods include magnetic 
resonance imaging, metal isotope labelling and other 
detection methods sensitive for a metal ion-containing 
molecule. 

15 The magnetic antibodies described herein are 

particularly useful in magnetic imaging and magnetic 

separation schemes. 

In another embodiment, the invention describes 

the iterative method for producing useful metal 
20 binding polypeptides, particularly catalytic 

antibodies which use the metal ion as a cofactor in 

catalysis. 

C. Methods for Produ f^^na Metal Binding Sites 
25 The present invention relates generally to 

methods for producing polypeptide-def ined binding 
sites. The method involves the preparation of 
libraries of different binding sites on a phage 
display protein using degenerate oligonucleotides and 
30 primer extension reactions to incorporate the 

degeneracies into the binding site region of the 
display protein. Thereafter, the display protein is 
screened for the ability to bind to a metal ion. 



wo 94/18220 PCT/US94/01238 

- 30 - 



1. Phaaemid Display Proteins 

The display of the metal binding sites 
of this invention on a phagemid can be accomplished on 
any of the surface proteins of the filamentous phage 
5 particle, although particularly preferred are display 
proteins comprising gene III or gene VIII protein, as 
described herein. The use of gene III or gene VIII 
protein as a display protein on filamentous phage has 
been extensively described elsewhere herein. 

10 Particularly preferred display proteins are 

fusions involving the use of the phage particle 
membrane anchor derived from gene III or gene VIII 
fused to an immunoglobulin heavy or light chain as 
described herein. In this embodiment, the binding 

15 site is displayed in a CDR of the immunoglobulin heavy 

or light chain, which in turn is a fusion to the 
membrane anchor domain of the phage's gene III or gene 
VIII protein. 

When using an immunoglobulin heavy or light chain 

20 as the display protein, it is preferred to position 
the binding site within one or more of the 
complementarity determining regions, CDRl, CDR2 or 
CDR3. Using the Kabat immxmoglobulin amino acid 
residue sequence position numbering system, the light 

25 chain CDR's are as follows: CDRl (residues 23-35) , 

CDR2 (residues 49-57), and CDR3 (residues 88-98); and 
the heavy chain CDR's are as follows: CDRl (residues 
30-36), CDR2 (residues 49-66), and CDR3 (residues 
94-103). See, Kabat et al., "Sequences of Proteins of 

30 Immunological Interest", 5th ed., NIH, (1991). 

When mutagenizing a CDR of an immunoglobulin 
fusion display protein, some, most or all of the CDR 
can be removed and substituted by the newly 
incorporated sequences that create the metal binding 
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site* CDRs are very accommodating to variably sized 
inserts without disrupting the ability of the 
immunoglobulin to assemble and display the newly 
randomized and selected amino acid residue sequence. 
5 In one embodiment, a phage display protein can be 

engineered to contain multiple binding sites. For 
example, using the heavy chain immunoglobulin as 
exemplary, binding sites can be created separately by 
the methods of this invention into one or more of the 

10 CDRs, designated CDRl, CDR2 and CDR3 . Additionally, 

one can introduce binding sites into a heavy chain CDR 
and a light chain CDR, into multiple heavy and light 
chain CDRs, and the like combinations. 

In another embodiment, the phage display protein 

15 is engineered to include stabilization features in 

addition to the stabilization provided by the native 
structure of the display protein. To that end, 
cysteine residues can be coded for by the 
oligonucleotide, such that disulfide bridges can be 

20 formed. The placement of the cysteine residues can be 

varied, such that a loop structure of from about 5 to 
20 amino acid residues is formed. 

2. Oligonucleotides 

25 The preparation of a metal binding site 

according to the present invention involves the use of 
synthetic oligonucleotides designed to introduce a 
putative binding site into a display protein. 
Furthermore, the oligonucleotide strategy described 

30 herein has particular advantages in creating in a 

single reaction an extremely large population of 
different randomized binding sites by the use of 
degenerate oligonucleotides. 

The present invention describes methods for 
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producing binding sites that bind metal ions. The 
methods generally follow the methods described herein 
with a few exceptions. 

The mutagenizing oligonucleotide has a structure 
5 that is generally degenerate throughout the central 

portion as described in co-pending application Serial 
No. 07/826,623 filed January 27, 1992, having the 
title "Heterodimeric Receptor Libraries Using 
Phagemids", the teachings of which are hereby 

10 incorporated by reference. 

The mutagenizing oligonucleotide randomizes the 
amino acid residue sequence of the immunoglobulin CDR, 
and the subsequent screening of the expressed phagemid 
display protein for metal ion binding is conducted as 

15 described herein and further in the Examples. 

Several oligonucleotide designs were utilized to 
form a metal binding site of varying lengths 
comprising a CDR. In one design, a series of 5, 10 or 
16 consecutive amino acid residues were randomized by 

20 a degenerate oligonucleotide. In another design, 

select amino acid residues were maintained while the 
remainder of residues in the site were randomized. 
Thus, one oligonucleotide introduced a 5 amino acid 
residue sequence in which the fourth amino acid 

25 residue was kept constant as aspartic acid (D) . 

Another oligonucleotide design introduced a 10 amino 
acid residue sequence in which the first and ninth 
amino acid residues were kept constant as glycine (G) 
and aspartic acid (D) , respectively. A third 

30 oligonucleotide design introduced a 16 amino acid 

residue sequence in which the first and fifteenth 
amino acid residues were kept constant as glycine (G) 
and aspartic acid (D) , respectively. 

The general structure of an oligonucleotide for 
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use in the present methods has the general formula 
ANB, where A and B define regions of homology to 
regions of the display protein gene which flank the 
site in which a binding site is to be inserted and N 
5 defines region of degeneracy in which variable amino 
acid residues are introduced. 

The number of nucleotides for each region (A, B, 
or N) can vary widely, but must be in triplets so as 
to preserve the reading frame of the display protein. 

10 Typically, A and B are of sufficient length to confer 
hybridization specificity with the template during the 
primer extension reaction. Thus, A and B are 
typically each at least 6 nucleotides, and preferably 
each at least 9 nucleotides in length, although they 

15 can be 12, 15, 18, 21 and up to about 24 nucleotides 

in length. The N's are typically of a widely variable 
length coding typically from 3 to 50 amino acid 
residues, preferably 3 to 24 amino acid residues in 
length. 

20 Where the display protein is an immunoglobulin, 

the homologies are directed to the immunoglobulin 
framework regions (FR) that flank the CDR into which 
the binding site is to be inserted. 

Thus, in one embodiment, the invention 

25 contemplates an oligonucleotide useful as a primer for 

producing a metal binding site in a polypeptide coded 
for by a hybrid immunoglobulin heavy or light chain 
gene. The oligonucleotide has 5' and 3' teriuini and 
comprises: 

30 i) a nucleotide sequence of about 5 to 50 

nucleotides in length at the 3 • termini capable of 
hybridizing to a first framework region of the 
immunoglobulin gene; 

ii) a nucleotide sequence of about 5 to 50 
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nucleotides in length at the 5' termini capable of 
hybridizing to a second framework region, of the 
immunoglobulin gene? and 

iii) a nucleotide sequence between said 5« and 3' 
5 termini according to the formula: 

[NNS]., 

where a is a whole integer from 3 to 50, N is 
independently any nucleotide, S is cytosine (C) or 
guanosine (G) or analogs thereof, and wherein said 5' 

10 and 3« terminal nucleotide sequences have a length of 
about 6 to 50 nucleotides in length, and sequences 
complementary thereto. 

The choice of framework regions depends on the 
CDR into which the binding site is to be inserted. 

15 Thus, for example, for an insertion into CDR3, the 3' 

and 5* regions of the oligonucleotides are selected as 
to be complementary in nucleotide sequence to the 
coding strand defining FR4 and FR3 that flank CDR3, 
respectively, where the oligonucleotide is to be 

20 complementary to the noncoding (anti-sense) strand of 
the template DNA. 

A preferred and exemplary CDR for insertion of a 
binding site is the CDR3 of immunoglobulin heavy 
chain. Particularly preferred is the immunoglobulin 

25 heavy chain display protein present in the vectors 

pC3AP313 and p7EIII, described herein. 

Oligonucleotides used in the present methods that 
are particularly preferred for producing a metal 
binding site of 5 and 10 amino acid residues in the 

30 heavy chain of CDR3 have the following respective 

nucleotide sequence formulas: HCDR5 

5 • -GTGTATTATTGTGCGAGA (NNS) 5TGGGGCCAAGGGACCACG-3 • (SEQ 
ID NO 50) AND HCDRIO 

5 " -GTGTATTATTGTGCGAGA (NNS) ioTGGGGCCAAGGGACCACG-3 • (SEQ 
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ID NO 51) . 

Additionally contemplated for use in this 
invention is the complementary nucleotide sequence of 
the preferred oligonucleotide formulation and 
5 identified sequences. Specifically, the complementary 

oligonucleotide has 5' and 3' termini between which is 
the nucleotide formulation [SNN]^- This alternative 
embodiment of a complementary oligonucleotide thus 
hybridizes to the coding (sense) strand of the 

10 template DNA. 

In another embodiment, the invention contemplates 
an oligonucleotide useful as a primer for producing a 
metal binding site in a polypeptide coded for by a 
hybrid immunoglobulin heavy or light chain gene, where 

15 two amino acid residues of the gene are retained. The 

oligonucleotide has 5' and 3' termini and comprises: 

i) a nucleotide sequence of about 5 to 50 
nucleotides in length at the 3* termini capable of 
hybridizing to a first framework region of the 

20 immunoglobulin gene; 

ii) a nucleotide sequence of about 5 to 50 
nucleotides in length at the 5* termini capable of 
hybridizing to a second framework region of the 
immunoglobulin gene; and 

25 iii) a nucleotide sequence between said 5» and 3' 

termini according to the fonaula: 

-X- [ NNK ] a-X- [ NNK ] -X- , 
where a is a whole integer from 3 to 50, X is a 
trinucleotide encoding a native amino acid residue 

30 coded by the immunoglobulin gene, N is independently 
any nucleotide, K is guanosine (G) or thymine (T) or 
analogs thereof, wherein said internal X is one 
nucleotide triplet encoding one immunoglobulin gene 
amino acid residue and wherein said 5' and 3' terminal 
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nucleotide sequences have a length of about 6 to 50 
nucleotides in length, and sequences complementary 
thereto. 

A preferred and exemplary CDR for insertion of a 
5 binding site is the CDR3 of immunoglobulin heavy 

chain. Particularly preferred is the immunoglobulin 
heavy chain display protein present in the vectors 
PC3AP313 and p7EIII, described herein. 

Oligonucleotides used in the present methods that 
10 are particularly preferred for producing a metal 

binding site of 10 and 16 amino acid residues in the 
heavy chain of CDR3 have the first and next to last 
amino acid residues in the mutagenized CDR kept 
constant as glycine (G) and aspartic acid (D) 
15 residues, respectively. These preferred 

oligonucleotides have the following respective 
nucleotide sequence formulas: HCDRDIO 

5 ' -GCCGTGTATTACTGTGCGAGAGGTIMiaTGACNNKTGGGGCCAAGGGACC 

ACGGTC-3' (SEQ ID NO 52) and HCDRD16 

20 5 • -CCCCT?T^'^'^^^<"'^^'T'r.rn&f; AGGT f NNK^ „GACNNKTGGGGCCAAGGGAC 

CACGGTC-3' (SEQ ID NO 53). The underlined nucleotides 
in the above oligonucleotide formulations indicate the 
complete 10 or 16 amino acid residue CDR. 

Another preferred oligonucleotide used in the 

25 present methods for producing a metal binding site of 
5 amino acid residues in the heavy chain of CDR3 has 
the next to last amino acid residue in the mutagenized 
CDR kept constant as an aspartic acid (D) residue. 
This oligonucleotide has the following nucleotide 

30 sequence formula: HCDRD5 

5 • -CGGGTGTATTACTGTGCGAGA rWNK^ .GACNNK TGGGGCCAAGGGACCACG 
GTC-3' (SEQ ID NO 54). The underlined nucleotides in 
the above oligonucleotide formulation indicate the 
complete 5 amino acid residue CDR. 



Additionally contemplated for use in this 
invention is the compleaentary nucleotide sequence of 
the preferred oligonucleotide formulation and 
identified sequences. Specifically, the complementary 
oligonucleotide has 5* and 3' termini between which is 
the nucleotide formulation -X-[MNN]-X-[MNN]„-X-, 
wherein all elements of the formulation are as 
presented above. This alternative embodiment of a 
complementary oligonucleotide thus hybridizes to the 
coding (sense) strand of the template DNA. 

Oligonucleotides for use in the present invention 
can be synthesized by a variety of chemistries as is 
well known. An excellent review is "Oligonucleotide 
Synthesis: A Practical Approach", ed. M.J. Gait, JRL 
Press, New York, NY (1990). Suitable synthetic 
methods include, for example, the phosphotriester or 
phosphodiester methods see Narang et al., Meth. 
Enzvmol. . 68:90, (1979); U.S. Patent No. 4,356,270; 
and Brown et al., Meth. Enzvmol. , 68:109, (1979). 
Purification of synthesized oligonucleotides. for use 
in primer extension and PGR reactions is well known. 
See, example Ausubel et al., "Current Protocols in 
Molecular Biology", John Wiley & Sons, New York, 
(1987). Oligonucleotides for use in the present 
invention are commercially synthesized by Operon 
Technologies, Alameda, CA. 

3 . Primer Extension Reactions 

The term "polynucleotide" as used 
herein in reference to primers, probes and nucleic 
acid fragments or segments to be synthesized by primer 
extension is defined as a molecule comprised of two or 
more deoxyribonucleotides or ribonucleotides, 
preferably more than three. Its exact size will 
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depend on many factors, which in turn depends on the 
ultimate conditions of use. 

The term "primer" as used herein refers to a 
polynucleotide whether purified from a nucleic acid 
5 restriction digestion reaction or produced 

synthetically, which is capable of acting as a point 
of initiation of nucleic acid synthesis when placed 
under conditions in which synthesis of a primer 
extension product which is complementary to a nucleic 

10 acid strand is induced, i.e., in the presence of 

nucleotides and an agent for polymerization such as 
DNA polymerase, reverse transcriptase and the like, 
and at a suitable temperature and pH. The primer is 
preferably single stranded for maximum efficiency, but 

15 may alternatively be in double stranded form. If 
double stranded, the primer is first treated to 
separate it from its complementary strand before being 
used to prepare extension products. Preferably, the 
primer is a polydeoxyribonucleotide. The primer must 

20 be sufficiently long to prime the synthesis of 

extension products in the presence of the agents for 
polymerization. The exact lengths of the primers will 
depend on many factors, including temperature and the 
source of primer. For example, depending on the 

25 complexity of the target sequence, a polynucleotide 

primer typically contains 15 to 25 or more 
nucleotides, although it can contain fewer 
nucleotides. Short primer molecules generally require 
cooler temperatures to form sufficiently stable hybrid 

30 complexes with template. 

The primers used herein are selected to be 
"substantially" complementary to the different strands 
of each specific sequence to be synthesized or 
amplified. This means that the primer must be 



sufficiently complementary to non-randomly hybridize 
with its respective template strand. Therefore, the 
primer sequence may or may not reflect the exact 
sequence of the template . For example , a 
non-complementary nucleotide fragment can be attached 
to the 5' end of the primer, with the remainder of the 
primer sequence being substantially complementary to 
the strand. Such non-complementary fragments 
typically code for an endonuclease restriction site. 
Alternatively, non-complementary bases or longer 
sequences can be interspersed into the primer, 
provided the primer sequence has sufficient 
complementarily with the sequence of the strand to be 
synthesized or amplified to non-randomly hybridize 
therewith and thereby form an extension product under 
polynucleotide synthesizing conditions. 

Primers of the present invention may also contain 
a DNA-dependent RNA polymerase promoter sequence or 
its complement. See for example, Krieg et al., Nucl. 
Acids Res. . 12:7057-70 (1984); Studier et al. , J. Mol. 
Biol. . 189:113-130 (1986); ant^ Molecular Cloning: A 
Laboratory Manual. Secon d Edition. Sambrook et al., 
eds. , Cold Spring Harbor, NY (1989). 

When a primer containing a DNA-dependent RNA 
polymerase promoter is used the primer is hybridized 
to the polynucleotide strand to be amplified and the 
second polynucleotide strand of the DNA-dependent RNA 
polymerase promoter is completed using an inducing 
agent such as E. coli DNA polymerase I, or the Klenow 
fragment of E. coli DNA polymerase. The starting 
polynucleotide is amplified by alternating between the 
production of an RNA polynucleotide and DNA 
polynucleotide. 

Primers may also contain a template sequence or 
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replication initiation site for a RNA-directed RHA 
polymerase. Typical RNA-directed RNA polymerase 
include the QB replicase described by Lizardi et al., 
T^^o^^nhnoloav . 6:1197-1202 (1988). RNA-directed 
5 polymerases produce large numbers of RNA strands from 
a small number of template RNA strands that contain a 
template sequence or replication initiation site. 
These polymerases typically give a one million-fold 
amplification of the template strand as has been 
10 described by Kramer et al., J. Wol . Biol.. 89:719-736 
(1974) . 

The choice of a primer's nucleotide sequence 
depends on factors such as the distance on the nucleic 
acid from the region of the display protein gene into 
15 which a binding site is being introduced, its 

hybridization site on the nucleic acid relative to any 
second primer to be used, and the like. 

The PCR reaction is performed using any suitable 
method. Generally it occurs in a buffered aqueous 
20 solution, i.e., a PCR buffer, preferably at a pH of 
7-9, most preferably about 8. Preferably, a molar 
excess of the primer is admixed to the buffer 
containing the template strand. A large molar excess 
of about 10*: 1 of primer to template is preferred to 
25 improve the efficiency of the process. 

The PCR buffer also contains the 
deoxyribonucleotide triphosphates dATP, dCTP, dGTP, 
and dTTP and a polymerase, typically thermostable, all 
in adequate amounts for primer extension 
30 (polynucleotide synthesis) reaction. The resulting 

solution (PCR admixture) is heated to about 90 degrees 
Celsius (90C) - lOOC for about 1 to 10 minutes, 
preferably from 1 to 4 minutes. After this heating 
period the solution is allowed to cool to 54C, which 



wo 94/18220 



41 



PCT/US94/01238 



is preferable for primer hybridization. The synthesis 
reaction may occur at from room temperature up to a 
temperature above which the polymerase (inducing 
agent) no longer functions efficiently. Thus, for 
5 example, if DNA polymerase is used as inducing agent, 
the teH^erature is generally no greater than about 
40C. An exemplary PGR buffer comprises the following: 
50 mM KCl; 10 mM Tris-HCl; pH 8.3; 1.5 mM MgC^; 
0.001% (wt/vol) gelatin, 200 micromolar (uM) dATP; 200 

10 uM dTTP; 200 \M dCTP; 200 xiM dGTP; and 2.5 units 

Thermus acmaticus DNA polymerase I (U.S. Patent No. 
4,889,818) per 100 microliters of buffer- Exemplary 
PCR amplifications are performed using the buffer 
system as described in Example 1. 

-L5 The inducing agent may be any compound or system 

which will function to accomplish the synthesis of 
primer extension products, including enzymes. 
Suitable enzymes for this purpose include, for 
exaanple, E. • coli DNA polymerase I, Klenow fragment of 

20 E. coli DNA polymerase I, T4 DNA polymerase, other 

available DNA polymerases, reverse transcriptase, and 
other enzymes, including heat-stable enzymes, which 
will facilitate combination of -the nucleotides in the 
proper manner to form the primer extension products 

25 which are complementary to each nucleic acid strand. 

Generally, the synthesis will be initiated at the 3' 
end of each primer and proceed in the 5« direction 
along the template strand, until synthesis terminates, 
producing molecules of different lengths. There may 

30 be inducing agents, however, which initiate synthesis 
at the 5' end and proceed in the above direction, 
using the same process as described above. 

The inducing agent also may be a compound or 
system which will function to accomplish the synthesis 
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Of RNA primer extension products, including enzymes. 
In preferred embodiments, the inducing agent may be a 
DNA-dependent RNA polymerase such as T7 RNA 
polymerase, T3 RNA polymerase or SP6 RNA polymerase. 
5 These polymerases produce a compl«nentary RNA 

polynucleotide. The high turn over rate of the RNA 
polymerase amplifies the starting polynucleotide as 
has been described by Chamberlin et al.. The Enzymes , 
ed. P. Boyer, PP. 87-108, Academic Press, New York 

10 (1982). Another advantage of T7 RNA polymerase is 

that mutations can be introduced into the 
polynucleotide synthesis by replacing a portion of 
cDNA with one or more mutagenic oligodeoxynucleotides 
(polynucleotides) and transcribing the 

15 partially-mismatched template directly as has been 

previously described by Joyce et al., Nuc. Acids Res. , 
17:711-722 (1989). Amplification systems based on 
transcription have been described by Gingeras et al., 
in PGR Protocols. A Guide t o Methods and Applications. 

20 pp 245-252, Academic Press, Inc., San Diego, CA 
(1990) . 

If the inducing agent is a DNA-dependent RNA 
polymerase and therefore incorporates ribonucleotide 
triphosphates, sufficient amounts of ATP, CTP, GTP and 
25 UTP are admixed to the primer extension reaction 

admixture and the resulting solution is treated as 
described above. 

The newly synthesized strand and its 
complementary nucleic acid strand form a 
30 double-stranded molecule which can be used in the 

succeeding steps of the process, as is known for PCR. 

PGR is typically carried out by thermocycling 
i.e., repeatedly increasing and decreasing the 
temperature of a PCR reaction admixture within a 
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temperature range whose lower limit is about lOC to 
about 40C and whose upper limit is about. 90C to about 
lOOC. The increasing and decreasing can be 
continuous, but is preferably phasic with time periods 
5 of relative temperature stability at each of 

temperatures favoring polynucleotide synthesis, 
denaturation and hybridization. 

PGR amplification methods are described in detail 
in U.S. Patent Nos. 4,683,192, 4,683,202, 4,800,159, 

10 and 4,965,188, and at least in several texts including 
"PGR Technology: Principles and Applications for DNA 
Amplification", H. Erlich, ed. , Stockton Press, New 
York (1989); and "PGR Protocols: A Guide to Methods 
and Applications", Innis et al. , eds.. Academic Press, 

15 San Diego, Galifornia (1990); the teachings of which 
are hereby incorporated by reference. 

Preferred PGR reactions using the 
oligonucleotides and methods of this invention are 
described in the Examples. 

20 

4. Phage Display Vectors 

Random mutagenesis of CDRs in a V 
region and screening methods such as is described by 
Barbas et al, Proc. Nat l. Acad. Sci.. USA. 
25 89:4457-4461, (1992) can be used for preparing 

antibodies that contain metal binding specificities. 

The methods of the present invention for 
preparing metal binding sites involve the use of phage 
display vectors for their particular advantage of 
30 providing a means to screen a very large population of 
expressed display proteins and thereby locate one or 
more specific clones that code for a desired binding 
reactivity. 

The use of phage display vectors derives from the 



wo 94/18220 



PCT/US94/01238 



- 44 - 

previously described use of combinatorial libraries of 
antibody molecules based on phagemids. The 
combinatorial library production and manipulation 
methods have been extensively described in the 
5 literature, and will not be reviewed in detail herein, 
except for those feature required to make and use 
unique embodiments of the present invention. However, 
the methods generally involve the use of a filamentous 
phage (phagemid) surface expression vector system for 
10 cloning and expressing antibody species of the 
library. 

Various phagemid cloning systems for producing 
combinatorial libraries have been described by others. 
See for example the preparation of combinatorial 
15 antibody libraries on phagemids as described by Kang 

et al., Proc. Nat3. Acad. Sc i.. USA. 88:4363-4366 

(1991) ; Barbas et al. , Proc. Nat l. Acad. Sci.. USA, 
88:7978-7982 (1991); Zebedee et al., Proc. Natl. Acad. 
Sci. . USA . 89:3175-3179 (1992); Kang et al. , Proc. 

20 Natl. Acad. Sci.. USA . 88:11120-11123 (1991); Barbas 

et al., Proc. Natl. Acad. S ci.. USA. 89:4457-4461 

(1992) ; and Gram et al., Proc. Natl. Acad. Sci.. USA, 
89:3576-3580 (1992), the disclosures of which are 
hereby incorporated by reference. 

25 

a. Phage Displav Vector Str ucture 

A preferred phagemid vector of the 
present invention is a recombinant DNA (rDNA) molecule 
containing a nucleotide sequence that codes for and is 
30 capable of expressing a fusion polypeptide containing, 

in the direction of amino- to carboxy-terminus , (1) a 
prokaryotic secretion signal domain, (2) a 
heterologous polypeptide defining an immunoglobulin 
heavy or light chain variable region, and (3) a 
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filamentous phage membrane anchor domain. The vector 
includes DNA expression control sequences for 
expressing the fusion polypeptide, preferably 
prokaryotic control sequences. 
5 The filamentous phage membrane anchor is 

preferably a domain of the cpIII or cpvill coat 
protein capable of associating with the matrix of a 
filamentous phage particle, thereby incorporating the 
fusion polypeptide onto the phage surface. 

10 Preferred membrane anchors for the vector are 

obtainable from filamentous phage M13, fl, fd, and 
equivalent filamentous phage. Preferred membrane 
anchor domains are found in the coat proteins encoded 
by gene III and gene VIII. The membrane anchor domain 

15 of a filamentous phage coat protein is a portion of 
the carboxy terminal region of the coat protein and 
includes a region of hydrophobic amino acid residues 
for spanning a lipid bilayer membrane, and a region of 
charged amino acid residues normally found at the 

20 cytoplasmic face of the membrane and extending away 
from the membrane. 

In the phage fl, gene VIII coat protein's 
membrane spanning region comprises residue Trp-26 
through Lys-40, and the cytoplasmic region comprises 

25 the carboxy-terminal 11 residues from 41 to 52 (Ohkawa 
et al., .T. Biol. Chem. . 256:9951-9958, 1981). An 
exemplary membrane anchor would consist of residues 26 
to 40 of cpVIII. Thus, the amino acid residue 
sequence of a preferred membrane anchor domain is 

30 derived from the M13 filamentous phage gene VIII coat 
protein (also designated cpVIII or CP 8). Gene VIII 
coat protein is present on a mature filamentous phage 
over the majority of the phage particle with typically 
about 2500 to 3000 copies of the coat protein. 
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In addition, the amino acid residue sequence of 
another preferred membrane anchor domain is derived 
from the M13 filamentous phage gene III coat protein 
(also designated cpIII) . Gene III coat protein is 
5 present on a mature filamentous phage at one end of 

the phage particle with typically about 4 to 6 copies 
of the coat protein. 

For detailed descriptions of the structure of 
filamentous phage particles, their coat proteins and 

10 particle assembly, see the reviews by Rached et al., 
^^nT-obioi. Rev. . 50:401-427 (1986); and Model et al., 
in "The Bacteriophages: Vol. 2", R. Calendar, ed. 
Plenvim Publishing Co. , pp. 375-456 (1988). 

The secretion signal is a leader peptide domain 

15 of a protein that targets the protein to the 

periplasmic membrane of gram negative bacteria. A 
preferred secretion signal is a pelB secretion signal. 
The predicted amino acid residue sequences of the 
secretion signal domain from two pelB gene product 

20 variants from F.rwinia carotova are described in Lei et 
al-. Nature . 331:543-546 (1988). 

The leader sequence of the pelB protein has 
previously been used as a secretion signal for fusion 
proteins (Better et al., science . 240:1041-1043 

25 (1988); Sastry et al., Proc. Na tl. Acad. Sci. . USA, 

86:5728-5732 (1989); and Mullinax et al. , pyoc. Natl. 
ftnad. Sci.. USA . 87:8095-8099 (1990)). Amino acid 
residue sequences for other secretion signal 
polypeptide domains from E. coli useful in this 

30 invention as described in Oliver, Escherichia coli 
and Salmonella Tvphimurium . Neidhard, F.C. (ed.)» 
American Society for Microbiology, Washington, D.C., 
1:56-69 (1987). 

DNA expression control sec[uences comprise a set 
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of DNA expression signals for expressing a structural 
gene product and include both 5* and 3' elements, as 
is well known, operatively linked to the cistron such 
that the cistron is able to express a structural gene 
5 product. The 5' control sequences define a promoter 
for initiating transcription and a ribosome binding 
site operatively lixiked at the 5» terminus of the 
upstream translatable DNA sequence. 

The 3 ' control sec[uences define at least one 

10 termination (stop) codon in frame with and operatively 
linked to the heterologous fusion polypeptide. 

In preferred embodiments, the vector used in this 
invention includes a prokaryotic origin of replication 
or replicon, i.e., a DNA sequence having the ability 

15 to direct autonomous replication and maintenance of 

the recombinant DNA molecule extra chromosomally in a 
prokaryotic host cell, such as a bacterial host cell, 
transformed therewith. Such origins of replication 
are well known in the art. Preferred origins of 

20 replication are those that are efficient in the host 
organism. A preferred host cell is E. coli . A 
preferred strain of E. coli is the supE strain as an 
amber stop codon is translated as glutamine (Q) . For 
use of a vector in E. coli . a preferred origin of 

25 replication is ColEl found in pBR322 and a variety of 
other common plasmids. Also preferred is the pl5A 
origin of replication found on pACYC and its 
derivatives. The ColEl and plSA replicon have been 
extensively utilized in molecular biology, are 

30 available on a variety of plasmids and are described 

at least by Sambrook et al., in "Molecular Cloning: a 
Laboratory Manual", 2nd edition, Cold Spring Harbor 
Laboratory Press, New York (1989). 

The ColEl and pl5A replicons are particularly 
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10 



preferred for use in one embodiment of the present 
invention where two "binary" plasmids are utilized 
because they each have the ability to direct the 
replication of plasmid in cpli while the other 
replicon is present in a second plasmid in the same 
coii cell. In other words, ColEl and pl5A are 
non-interfering replicons that allow the maintenance 
of two plasmids in the same host (see, for example, 
Sambrook et al., supra , at pages 1.3-1.4). This 
feature is particularly important when using binary 
vectors because a single host cell permissive for 
phage replication must support the independent and 
simultaneous replication of two separate vectors, for 
example when a first vector expresses a heavy chain 
15 polypeptide and a second vector expresses a light 
chain polypeptide. 

In addition, those embodiments that include a 
prokaryotic replicon can also include a gene whose 
expression confers a selective advantage, such as drug 
20 resistance, to a bacterial host transformed therewith. 

Typical bacterial drug resistance genes are those that 
confer resistance to ampicillin, tetracycline, 
neomycin/kanamycin or chloramphenicol. Vectors 
typically also contain convenient restriction sites 
25 for insertion of translatable DNA sequences. 

Exemplary vectors are the plasmids pUC8, pUC9, pBR322, 
and pBR329 available from BioRad Laboratories, 
(Richmond, CA) and pPL and pKK223 available from 
Pharmacia, (Piscataway, NJ) . 

AS used herein, the term "vector" refers to a 
nucleic acid molecule capable of transporting between 
different genetic environments another nucleic acid to 
which it has been operatively linked. Preferred 
vectors are those capable of autonomous replication 



30 
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and expression of structural gene products present in 
the DNA segments to which they are operatively linked. 
Vectors, therefore, preferably contain the replicons 
and selectable markers described earlier. 
5 As used herein with regard to DNA sequences or 

segments, the phrase "operatively linked" means the 
sequences or segments have been covalently joined, 
preferably by conventional phosphodiester bonds, into 
one strand of DNA, whether in single or double 

10 stranded form. The choice of vector to which a 

transcription unit or a cassette of this invention is 
operatively linked depends directly, as is well known 
in the art, on the functional properties desired, 
e.g. , vector replication and protein expression, and 

15 the host cell to be transformed, these being 

limitations inherent in the art of constructing 
recombinant DNA molecules. 

In a preferred embodiment, a DNA expression 
vector is designed for convenient manipulation in the 

20 form of a filamentous phage particle encapsulating a 

genome according to the teachings of the present 
invention. In this embodiment, a DNA expression 
vector further contains a nucleotide sequence that 
defines a filamentous phage origin of replication such 

25 that the vector, upon presentation of the appropriate 

genetic complementation, can replicate as a 
filamentous phage in single stranded replicative form 
and be packaged into filamentous phage particles. 
This feature provides the ability of the DNA 

30 expression vector to be packaged into phage particles 

for subsequent segregation of the particle, and vector 
contained therein, away from other particles that 
comprise a population of phage particles. 

A filamentous phage origin of replication is a 
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region of the phage genome, as is well known , that 
defines sites for initiation of replication, 
termination of replication and packaging of the 
replicative form produced by replication (see for 
5 example, Rasched et al., Microbiol, Rev, , 50; 401-427 . 

1986; and Horiuchi, J- Mol, Biol, . 188:215-223, 1986). 
A preferred filamentous phage origin of replication 
for use in the present invention is an M13, fl or fd 
phage origin of replication (Short et al., Wucl. Acids 

10 Res, , 16:7583-7600, 1988). 

Preferred DNA expression vectors for cloning and 
expressing a phagemid display protein of this 
invention are the dicistronic plasmid expression 
vectors pC3AP313 and p7EIII described herein. 

15 It is to be understood that, due to the genetic 

code and its attendant redundancies, numerous 
polynucleotide sequences can be designed that encode a 
contemplated heavy or light chain immunoglobulin 
variable region amino acid residue secjuence. Thus, 

20 the invention contemplates such alternate 

polynucleotide sequences incorporating the features of 
the redundancy of the genetic code. 

Insofar as the expression vector for producing a 
human monoclonal antibody of this invention is carried 

25 in a host cell compatible with expression of the 

antibody, the invention contemplates a host cell 
containing a vector or polynucleotide of this 
invention. A preferred host cell is E. coli , as 
described herein. 

30 The preferred phagemid expression vectors in the 

form of plasmids that produce a phagemid display 
protein of this invention were deposited pursuant to 
Budapest Treaty requirements with the American Type 
Culture Collection (ATCC) , Rockville, MD, for 
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co-pending application Serial No. 08/012,556, filed 
February 2, 1993, entitled "Methods for Producing 
Polypeptide Binding sites", the disclosures of which 
are hereby incorporated by reference. The phagemid 
5 expression vectors, pC3AP3l3 and pVEIII, have the 
respective ATCC Accession Numbers 75408 and 75409. 

b. n.Qft of Phaaemid p isnlav Vectors to 
Produce a Metal Binding Site 
A phagemid vector for use herein 
is a recombinant DNA (rDNA) molecule containing a 
nucleotide sequence that codes for and is capable of 
expressing an antibody-derived heterodimeric protein 
on the surface of the phagemid in the form of a 

15 phagemid display protein. Exemplary and preferred 

phagemid vectors are the plasmids pC3AP313 and p7EIII 
described herein and further in Example 1. 

The method for producing a metal binding site in 
a phagemid display protein generally involves (1) 

20 introducing a heavy or light chain V region-coding 

gene of interest into the phagemid display vector; (2) 
introducing a metal binding site into the phagemid 
display protein vector by primer extension with an 
oligonucleotide containing regions of homology to a 

25 CDR of the antibody V region gene and containing 

regions of degeneracy for producing randomized coding 
sequences as described herein, to form a large 
population of display vectors each capable of 
expressing different putative binding sites displayed 

30 on a phagemid surface display protein, (3) expressing 
the display protein and binding site on the surface of 
a filamentous phage particle, and (3) isolating the 
surface-expressed phage particle using affinity 
techniques such as panning of phage particles against 
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a preselected metal cation, thereby isolating one or 
more species of phagemid containing a display protein 
containing a binding site that binds a preselected 
metal cation. 

5 As a further characterization of the produced 

metal binding site, the nucleotide and corresponding 
amino acid residue sequence of the gene coding the 
randomized CDR is determined by nucleic acid 
sequencing. The primary amino acid residue sequence 

10 information provides essential information regarding 
the binding site's reactivity. 

An exemplary preparation of a metal binding site 
in the CDR3 of a heavy chain of an immunoglobulin is 
described in the Examples. The isolation of a 

15 particular vector capable of expressing a metal 

binding site of interest involves the introduction of 
the dicistronic expression vector able to express the 
phagemid display protein into a host cell permissive 
for expression of filamentous phage genes and the 

20 assembly of phage particles. Typically, the host is 

E> coli . Thereafter, a helper phage genome is 
introduced into the host cell containing the phagemid 
expression vector to provide the genetic 
complementation necessary to allow phage particles to 

25 be assembled. 

The resulting host cell is cultured to allow the 
introduced phage genes and display protein genes to be 
expressed, and for phage particles to be assembled and 
shed from the host cell. The shed phage particles are 

3d then harvested (collected) from the host cell culture 
media and screened for desirable metal cation binding 
properties. Typically, the harvested particles are 
"panned" for binding with a preselected metal cation. 
Another preferred method of screening involves the use 
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Of metal chelate affinity chromatography as described 
in the Examples. The strongly binding particles are 
then collected, and individual species of particles 
are clonally isolated and further screened for binding 
5 to the metal cation. Phage which produce a binding 
site of desired metal binding specificity are 
selected. 

With the above-described procedure, the resulting 
phagemids and phagemid display proteins were screened 

10 and selected for binding to either Cu^*, Zn^\ Pb^*, 
Ce'*, Fe'*, or magnetite, and the metal binding site 
containing clones were seguenced to determine the 
binding site sequence. Preferred metal binding site 
sequences were described earlier herein. The 

15 preferred antibody libraries for use in this invention 
are libraries E and F resulting from mutagenesis of 
the phagemid expression vector pC3AP313. Libraries E 
and F, respectively, contained 16 and 10 amino acid 
residue sequence-containing mutagenized CDR3s. These 

20 libraries retain the structurally significant aspartic 
acid residue in the penultimate position (the next to 
last position) within the CDR3. 

5, Preferret^ Metals for Self ^ntina Metal 
25 Rinding site-Spe cific Monoclonal 

Antibodies 

A randomized (mutagenized) CDR-containing antibody 
becomes a metal loantibody when it is complexed with a 
metal cation. Metal cations (cof actor cations) 
30 suitable for complexing with a metal binding protein 
of this invention are any of the transition state 
metals of the periodic table, and the non-transition 
state metals calcium (Ca) , zinc (Zn) , cadmium (Cd), 
cerium (Ce) , mercury (Hg) , strontium (Sr) , and barium 
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(Ba) , Which metals have the capacity to occupy a 
tetrahedral oxidation state and thereby complex with 
said protein through coordinated ligands provided by 
the three contact amino acid residues on the metal 
5 binding protein. Preferred metal cations for use in a 
metalloprotein of this invention are divalent and 
trivalent cations. Preferred divalent cations include 
Cu(II), Zn(II), Ni(II), Co(II), Fe(II), Ag(II) , 
Mn(II), Pb(II) or Cd(II), and more preferably Cu(II), 

10 Zn(II) , Ni(II) and Pb(II). Preferred trivalent 

cations include Ce(III) and Fe(III), the latter in the 
form of iron oxide or iron chloride. 

In preferred embodiments, the metal cation of a 
catalytic metalloantibody is Cu(II), and a preferred 

15 chemical reaction is the non-oxidative hydrolysis 

reactions as well as oxidative reactions. Other 
preferred embodiments include the following: 1) 
Fe(III) results in the oxidative cleavage of proteins 
and could be utilized as a cofactor to produce 

20 antibodies which cleave proteins at defined sequences; 
2) Pb(II) and Ni(III) is utilized to cleave nucleic 
acids; and 3) Lanthanide cerium, specifically, 
Ce(III) hydroxide clusters, is used to catalyze the 
hydrolysis of cAMP. 

25 Metal loantibodies are formed by preparing an 

antibody having a mutagenized CDR resulting in the 
formation of a metal binding site of the present 
invention, and then exposing the metal binding protein 
to a selected metal cation, preferably in a buffered 

30 aqueous medium, for a time sufficient to allow a 

metal-protein coordination complex to form. Preferred 
are the metalloantibodies prepared and described in 
the Examples. 
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D. MPthods frn- select ion of Catalytic Synthetic 
Antibodies 

A preferred einbodiment is general iterative 
strategy using the methods of this invention for 
selection of catalytic synthetic antibodies as shown 
in Figure 2. The first step in this strategy involves 
selection of sequences within the antibody combining 
site which coordinate metal. The preferred regions in 
which mutagenesis is performed are the CDRs. 
Selection is most easily achieved by immobilization of 
the target. Preferred for use is the immobilization 
of a metal ion utilized a support derivatized with a 
chelating agent, such iminodiacetic acid supports are 
suitable for the immobilization of many metals. 
Furthermore, this chelating agent occupies a limited 
number of coordination sites on the metal and provides 
the metal with a number of vacant sites (HjO) which 
are utilized to select coordinating antibodies from 
the library. Preferred alternatives to an inert 
support are synthetic transition-state analogs that 
immobilize the metal for selection. 

The second step of the method for selecting 
synthetic catalytic antibodies involves the 
diversification of the library either by chain 
shuffling or mutagenesis of another CDR. From this 
pool of antibodies which coordinate metal, a second 
selection step is then utilized to optimize binding to 
a hapten which may also provide a site to coordinate 
metal although coordination to the substrate may not 
be necessary so long as the metal participates in the 
chemistry. For example, a Zn atom coordinated to one 
CDR may be utilized to deliver a hydroxide ion to the 
carbonyl carbon of a peptide bound to another region 
of the antibody molecule. The light chain CDR3 may be 
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the most desirable region to install metal binding as 
the extended heavy chain CDR3 could be utilized to 
provide the hapten binding pocket, similar to that 
seen with known copper containing proteins that 
coordinate copper within a loop of a Greek key folded 
domain as described by Adam, Adv. Prot. Chem. , 
42:145-197 (1991). Antibodies exhibit this same fold. 
To that end, a naturally occurring copper-binding 
antibody derived from a myeloma patient has been 
previously described by Baker et al. , J. Biol. Chem. , 
253:8444-8451 (1978). A mercury-binding antibody has 
also recently been described by Wylie et al., Prpc 
Natl. Acad. Sci.. USA . 82:4104-4108 (1992). 

Preferred metal binding sites in the antibody 
15 combining site useful in catalysis include the 

following: 1) Cu(II) has been utilized to catalyze 
nonoxidative hydrolysis reactions as well as oxidative 
reactions as described by Chiou, J. Biochem. , 
94:1259-1267 (1983) and Fife et al., J. Am. Chemt 
20 Soc. . 105:1638-1642 (1983); 2) Fe(III) is 

particularly interesting for the oxidative cleavage of 
proteins and could be utilized as a cofactor to 
produce antibodies which cleave proteins at defined 
sequences as described by Rana et al., J. Am. Chem. 
25 Soc. . 112:2457-2458 (1990); 3) Pb(II) and Ni (III) 

have been utilized to cleave RNA and DNA, 
respectively, as described by Brown et al., Nat^upe , 
303:543-546 (1983) and Cheng et al., Angew. Chem. Int. 
Ed. Enal. . 32:277-278 (1993); and 4) Cerium(III) 
30 hydroxide clusters have recently been demonstrated to 
catalyze the hydrolysis of cAMP with a lo" fold rate 
enhancement as described by Sumacka et al., J. Chem. 
Rne. Chem. Commun. . 1707-1708 (1992) . Cerium, thus, 
is a particularly interesting metal to incorporate 



into antibodies in an effort to produce artificial 
restriction enzymes. Furthermore, cerium. serves as an 
example of a catalytically interesting metal for which 
there is no known naturally occurring protein from 
which to design a metal binding site. 

E. Metal Bint^ina Site Compositions and Uses 
Thereof 

1. Metal Binding Si ^r.^ Monoclonal 
Antibodies 

The present invention describes, in one 
embodiment, human monoclonal antibodies that contain a 
binding site as described herein and that bind 
specifically to a preselected target molecule. The 
invention also describes cell lines that produce the 
antibodies, methods for producing the cell lines, and 
methods for producing the human monoclonal antibodies. 

Insofar as a display protein of this invention on 
a phagemid particle is, in preferred embodiments, a 
fusion protein between an immunoglobulin heavy or 
light chain and a filamentous phage membrane anchor, 
it is to be understood that the display protein is, in 
effect, an engineered immunoglobulin heavy or light 
chain into which a binding site has been introduced by 
the methods of this invention. Furthermore, in many 
embodiments, the expression of the display protein is 
prepared on the phagemid surface as a heterodimer 
formed between immunoglobulin heavy and light chain 
polypeptides, with one or the other being a fusion 
protein with the membrane anchor. Thus, where the 
heavy chain is used as the fusion protein, a display 
protein in preferred embodiments comprises a Fab 
fragment having an anchored heavy chain associated 
with a light chain. 
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The preparation of cell lines producing 
monoclonal antibodies of the invention is described in 
great detail further herein, and can be accomplished 
using the phagemid vector mutagenesis methods 
5 described herein, and using routine screening 
techniques which permit determination of the 
elementary binding patterns of the monoclonal antibody 
of interest indicative that the binding site has been 
produced. Thus, if a human monoclonal antibody being 
10 tested binds to the preselected target molecule, then 
the human monoclonal antibody being tested and the 
human monoclonal antibody produced by the phagemid in 
a host cell of the invention are considered 
equivalent. 

15 It is also possible to determine, without undue 

experimentation, if a human monoclonal antibody has 
the same (i.e., equivalent) specificity as a human 
monoclonal antibody of this invention by ascertaining 
whether the former prevents the latter from binding to 

20 a preselected target molecule. If the human 

monoclonal antibody being tested competes with the 
human monoclonal antibody of the invention, as shown 
by a decrease in binding by the human monoclonal 
antibody of the invention in standard competition 

25 assays for binding to the target molecule when present 
in the solid phase, then it is determined that the two 
monoclonal antibodies bind to the same, or a closely 
related, epitope. 

Still another way to determine whether a human 

30 monoclonal antibody has the specificity of a human 
monoclonal antibody of the invention is to 
pre-incubate the hximan monoclonal antibody of the 
invention with the target molecule with which it is 
normally reactive, and then add the human monoclonal 
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antibody being tested to determine if the human 
monoclonal antibody being tested is inhibited in its 
ability to bind the target molecule. If the human 
monoclonal antibody being tested is inhibited then, in 
5 all likelihood, it has the same, or functionally 

equivalent, epitopic specificity as the monoclonal 
antibody of the invention. 

The immunospecificity of an antibody, its target 
molecule binding capacity, and the attendant affinity 
10 the antibody exhibits for the epitope, are defined by 
the epitope with which the antibody immunoreacts . The 
epitope specificity is defined at least in part by the 
amino acid residue sequence of the variable region of 
the heavy chain of the immunoglobulin the antibody, 
15 and in part by the light chain variable region amino 
acid residue sequence. 

Particularly preferred are human monoclonal 
antibodies having the binding specificity of the 
monoclonal antibodies produced in E. coli 
20 microorganisms or produced by plasmid vectors that are 
deposited with the ATCC, as described further herein. 

Use of the term "having the binding specificity 
of indicates that equivalent monoclonal antibodies 
exhibit the same or similar immunoreaction (binding) 
25 properties and compete for binding to a preselected 

target molecule. 

The term "conservative variation or substitution" 
as used herein denotes the replacement of an amino 
acid residue by another, biologically similar residue. 
30 Examples of conservative variations include the 

substitution of one hydrophobic residue such as 
isoleucine, valine, leucine or methionine for another, 
or the substitution of one polar residue for another, 
such as the substitution of arginine for lysine, 
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glutamic for aspartic acids, or glutamine for 
asparagine, and the like. 

The term "conservative variation" also includes 
the use of a substituted amino acid in place of an 
5 unsubstituted parent amino acid provided that 

antibodies having the substituted polypeptide also 
bind to the preselected target molecule. Analogously, 
another preferred embodiment of the invention relates 
to polynucleotides which encode the above noted heavy 
10 and/or light chain polypeptides and to polynucleotide 
sequences which are complementary to these 
polynucleotide sequences. Complementary 
polynucleotide sequences include those sequences which 
hybridize to the polynucleotide sequences of the 
15 invention under stringent hybridization conditions. 

Human monoclonal antibodies offer particular 
advantages over murine monoclonal antibodies, 
particularly insofar as they can be used 
therapeutically in humans. Specifically, human 
20 antibodies are not cleared from the circulation as 

rapidly as "foreign" antigens, and do not activate the 
immune system in the same manner as foreign antigens 
and foreign antibodies. 

The invention contemplates, in one esibodiment, a 
25 monoclonal antibody expressed on the surface of 

bacteriophage as a phagemid display protein of this 
invention produced by the present methods. 

In another preferred embodiment, the invention 
contemplates a truncated immunoglobulin molecule 
30 comprising a Fab fragment derived from a human 
monoclonal antibody of this invention. The Fab 
fragment, lacking Fc receptor, is soluble, and affords 
therapeutic advantages in serum half life, and 
diagnostic advantages in modes of using the soluble 
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Fab fragment; The preparation of a soluble Fab 
fragment is generally knovm in the immunological arts 
and can be accomplished by a variety of methods. A 
preferred method of producing a soluble Fab fragment 

5 is described herein. 

In one embodiment, the invention describes a 
class of human monoclonal antibodies which immunoreact 
with metal cations including Cu(II), Zn(II) , Ni(II), 
Pb(II), Ce(III), Fe(III), iron oxide and the like. 
10 The antibodies were produced by the present methods in 
which a metal binding sites in a CDR3 of an 
immunoglobulin heavy chain variable domain were 
obtained as a result of the mutagenesis of the CDR3 
followed by selection oh metals, as described in the 

15 Examples. 

The resulting population of display vectors 
containing a metal binding site in the display protein 
were screened first for binding activity by metal 
chelate affinity chromatography or by panning against 

20 the target metal as described in the Examples. 

Thereafter, selected members of the population of 
display vectors with metal binding activity were 
isolated, soluble Fab expression was engineered into 
each of the selected vectors as described in the 

25 Examples, and the resulting soluble Fab were 

expressed, and screened for binding or other 
functional activities. 

The binding properties of the soluble purified 
Fabs were then analyzed by ELISA against metal ions 

30 immobilized by means of a support derivatives with a 

chelating agent, such as EDTA. A preferred support, 
for use in this invention is iminodiacetic acid (IDA) 
complexed with bovine serum albumin (BSA) and the 
preselected metal. The chelating agent functions by 



occupying a limited number of coordination sites on 
the metal and provides the metal with a number of 
vacant sites that are utilized to select coordinating 
antibodies from a phagemid library. A preferred 
alternative to an inert support is the use of 
synthetic transition-state analogs, the design of 
which is familiar to one of ordinary skill in the art, 
that are used to immobilize the metal for selection. 

Anti-metal human monoclonal antibodies were 
identified which have the desirable property of high 
affinity binding to selected metals with diversity 
within a class of metal binders. For example, three 
of the copper-reactive antibodies exhibited unique 
binding characteristics, as described in the Examples. 
Data presented in the Examples illustrates anti-metal 
human monoclonal antibodies produced by the methods of 
this invention that are potent metal binders as 
compared to those anti-metal antibodies produced by 
conventional means. 

Particularly preferred human monoclonal 
anti-metal antibodies are those having the 
immunoreaction (binding) specificity of a monoclonal 
antibody having heavy and light chain immunoglobulin 
variable region amino acid residue sequences in pairs 
(H:L) where the light chain is the light chain encoded 
by the plasmid vector pC3AP313 described herein, and 
referred to as light chain 313, or L313, and the heavy 
chain has one of the recited binding sites, and 
conservative substitutions thereof. The designation 
of a human monoclonal antibody with a colon, e.g., 
H:L313 is to connote a H:L pair formed by the heavy 
and light chain, respectively, in which the light 
chain is the preferred L313 light chain described 
herein . 
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2. raiialvtir Monoclo nal Antibodies with 
M«a1-a1 Bind ing Sites 
A particularly preferred embodiment 
involves using the anti-metal antibodies of the 
present invention to produce catalytic antibodies, 
i.e., antibodies having a metal binding site which can 
participate in promoting catalysis. Thus, the 
invention contemplates producing a metal leant ibody 
which has the capacity of promoting a predetermined 
chemical reaction, i.e., a catalytic metalloantibody. 
catalytic antibodies have been described by Tramontane 
et al., science, 234:1566-1570 (1986); Pollack et al., 
science . 234:1570-1573 (1986); Janda et al.. Science , 
241:1188-1191 (1988); Janda et al.. Science , 244: 
15 437-440 (1989) , and in United States Patent No. 

4,659,567, No. 4,900,674, No. 5,030,717, No. 5,079,152 
and NO. 5,126,258, the disclosures of which are hereby 
incorporated by reference. In this embodiment, a 
preselected antigen with which the antibody combining 
site immunoreacts is also a substrate for a reaction 
that is promoted by the metalloantibody. 



3. Mfttal Ch'alation t^y Monoclonal 

AT^tibodipg Contai ning Metal Binding 

25 Sites 

A metal binding protein can function as 
a metal ion chelator. Thus, the metal binding 
antibodies of this invention are suitable for use as 
metal ion chelators. One of the well-characterized 

30 biological chelator of metals is metallothionein which 
is a ubiquitous, low-molecular weight, metal-binding 
protein that participates in metal metabolism and 
detoxification as described by Morelock et al., in 
"Metallothionein II", Experientia Supplementum 
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(Birkhaeuser, Basel), 52:247-253 (1987). 
Metallothionein has been shown to bind metals, such as 
technetium or indium, useful for targeted 
radiodiagnosis or therapy. 

Generally, for use in radioimmunoimaging, 
chelators are chemically conjugated to antibodies. 
The disadvantage of the conjugation approach is that 
the process may result in decreased immunoreactivity 
of the conjugated antibody or lead to increased 
protein aggregation. To overcome these deleterious 
effects, chimeric antibodies have been produced in 
which the metallothionein chelator has been engineered 
into the context of an immunoreactive antibody by 
means of recombinant DNA techniques as described by 
Das et al., Natl. a^^H. gci.. USA. 89:9749-9753 

(1992) and sawyer et al., Proc. Natl. Acad. Sci.. 
USA, 89:9754-9758 (1992), the disclosures of which are 
hereby incorporated by reference. The resultant 
chimeric antibody was shown to efficiently target to 
the specific antigen while allowing for detection of 
the metallothionein-chelated radioimaging material. 

The metal-binding antibodies of this invention 
provide further advantages in that instead of creating 
chimeric antibodies, any number of chelating 
antibodies with unique specificities and binding 
characteristics can be selected by screening against 
specific target molecules, such as radioimaging 
reagents. Another embodiment of this invention is the 
preparation of such metal-chelating antibodies that 
exhibit immunoreactivity against a preselected 
antigen, such as a hapten or protein, as described 
herein for the selection of catalytic antibodies and 
as shown in Figure 2. 
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F. Therapeutic and Di agnostic Compositions 
The present invention contemplates 
therapeutic and diagnostic compositions useful for 
practicing the therapeutic or diagnostic methods 
5 described herein. Therapeutic compositions of the 

present invention contain a physiologically tolerable 
carrier together with at least one species of human 
monoclonal antibody as described herein, dissolved or 
dispersed therein as an active ingredient. In a 

10 preferred embodiment, the therapeutic composition is 
not immunogenic when administered to a human patient 
for therapeutic purposes. 

As used herein, the terms "pharmaceutically 
acceptable", "physiologically tolerable" and 

15 grammatical variations thereof, as they refer to 

compositions, carriers, diluents and reagents, are 
used interchangeably and represent that the materials 
are capable of administration to or upon a human 
without the production of undesirable physiological 

20 effects such as nausea, dizziness, gastric upset and 
the like. 

The preparation of a pharmacological composition 
that contains active ingredients dissolved or 
dispersed therein is well understood in the art. 

25 Typically such compositions are prepared as sterile 

injectables either as liquid solutions or suspensions, 
aqueous or non-aqueous, however, solid forms suitable 
for solution, or suspensions, in liquid prior to use 
can also be prepared. The preparation can also be 

30 emulsified. Thus, an antibody molecule-containing 

composition can take the form of solutions, 
suspensions, tablets, capsules, sustained release 
formulations or powders, or other compositional forms. 
The active ingredient can be mixed with 
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excipients which are pharmaceutically acceptable and 
compatible with the active ingredient and in amounts 
suitable for use in the therapeutic methods described 
herein. Suitable excipients are, for example, water, 
5 saline, dextrose, glycerol, ethanol or the like and 
combinations thereof. In addition, if desired, the 
composition can contain minor amounts of auxiliary 
substances such as wetting or emulsifying agents, pH 
buffering agents and the like which enhance the 
10 effectiveness of the active ingredient. 

The composition of the present invention can 
include pharmaceutically acceptable salts of the 
components therein. Pharmaceutically acceptable salts 
include the acid addition salts (formed with the free 
15 amino groups of the polypeptide) that are formed with 
inorganic acids such as, for example, hydrochloric or 
phosphoric acids, or such organic acids as acetic, 
tartaric, mandelic and the like. Salts formed with 
the free carboxyl groups can also be derived from 
inorganic bases such as, for example, sodium, 
potassium, ammonium, calcium or ferric hydroxides, and 
such organic bases as isopropylamine, tr imethylamine , 
2-ethylamino ethanol, histidine, procaine and the 
like. 

Physiologically tolerable carriers are well known 
in the art. Exemplary of liquid carriers are sterile 
aqueous solutions that contain no materials in 
addition to the active ingredients and water, or 
contain a buffer such as sodium phosphate at 
physiological pH value, physiological saline or both, 
such as phosphate-buffered saline. Still further, 
aqueous carriers can contain more than one buffer 
salt, as well as salts such as sodium and potassium 
chlorides, dextrose, propylene glycol, polyethylene 



20 



25 
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glycol, and other solutes. 

Liquid compositions can also contain liquid 
phases in addition to and to the exclusion of water. 
Exemplary of such additional liquid phases are 
5 glycerin, vegetable oils such as cottonseed oil, 

organic esters such as ethyl oleate, and water-oil 
emulsions . 

A composition contains a human monoclonal 
antibody of the present invention, typically in an 

10 amount of at least 0.1 weight percent of antibody per 
weight of total composition. A weight percent is a 
ratio by weight of antibody to total composition. 
Thus, for example, 0.1 weight percent is 0.1 grams of 
antibody per 100 grams of total composition. 

15 Preferably, an antibody-containing composition 

typically contains about 10 micrograms (ug) per 
milliliter (ml) to about lOO milligrams (mg) per ml of 
antibody as active ingredient per volume of 
composition, and more preferably contains about 1 

20 rog/ml to about 10 mg/ml (i.e., about 0.1 to 1 weight 
percent) . 

G. Therapeu'fcic Methods 

In view of the benefit of using htiman 

25 monoclonal antibodies in vivo in human patients, the 

presently described antibodies are particularly well 
suited for in vivo use as a therapeutic reagent for 
blocking or inhibiting the function of the target 
molecule which the antibody binds or for catalyzing 

30 therapeutically relevant reaction involving a target 

molecule. The method comprises contacting a sample 
believed to contain the target molecule with a 
composition comprising an effective amount of a human 
monoclonal antibody of this invention which binds the 
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target molecule. 

For in vivo modalities, the method comprises 
administering to the patient an effective amount of a 
physiologically tolerable composition containing a 
5 human monoclonal antibody of the invention. 

The dosage ranges for the administration of the 
monoclonal antibodies of the invention are those large 
enough to produce the desired effect. The dosage 
should not be so large as to cause adverse side 

10 effects, such as hyperviscosity syndromes, pulmonary 
edema, congestive heart failure, and the like. 
Generally, the dosage will vary with the age, 
condition, sex and extent of the disease in the 
patient and can be determined by one of skill in the 

15 art . 

The dosage can be adjusted by the individual 
physician in the event of any complication. 

An effective amount of an antibody of this 
invention is typically an amount of antibody such that 
20 When administered in a physiologically tolerable 
composition is sufficient to achieve a plasma 
concentration of from about 0.1 ug/ml to about 100 
ug/ml, preferably from about 1 ug/ml to about 5 ug/ml, 
and usually about 5 ug/ml. stated differently, the 
25 dosage can vary from about 0.1 mg/kg to about 300 

mg/kg, preferably from about 0.2 mg/kg to about 200 
mg/kg, most preferably from about 0.5 mg/kg to about 
20 mg/kg, in one or more dose administrations daily, 
for one or several days. 

The human monoclonal antibodies of the invention 
can be administered parenterally by injection or by 
gradual infusion over time. Although the target 
molecule can typically be accessed in the body by 
systemic administration and therefore most often 
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treated by intravenous administration of compositions, 
other tissues and delivery means are contemplated 
where there is a likelihood that the tissue targeted 
contains the target molecule. Thus, human monoclonal 
5 antibodies of the invention can be administered 

intravenously, intraperitoneal ly, intramuscularly, 
subcutaneously, intracavity, transdermally , and can be 
delivered by peristaltic means. 

The compositions containing a human monoclonal 

10 antibody of this invention are conventionally 

administered intravenously, as by injection of a unit 
dose, for example. The term "unit dose" when used in 
reference to a therapeutic composition of the present 
invention refers to physically discrete units suitable 

15 as unitary dosage for the subject, each unit 

containing a predetermined quantity of active material 
calculated to produce the desired effect in 
association with the required diluent; i.e., carrier, 
or vehicle-. 

20 The compositions are administered in a manner 

compatible with the dosage formulation, and in a 
effective amount. The quantity to be administered 
depends on the subject to be treated, capacity of the 
subject's system to utilize the active ingredient, and 
25 degree of effect desired. Precise amounts of active 

ingredient required to be administered depend on the 
judgement of the practitioner and are peculiar to each 
individual. However, suitable dosage ranges for 
systemic application are disclosed herein and depend 
on the route of administration. Suitable regimes for 
administration are also variable, but are typified by 
an initial administration followed by repeated doses 
at one or more hour intervals by a subsequent 
injection or other administration. Alternatively, 
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continuous intravenous infusion sufficient to maintain 
concentrations in the blood in the ranges specified 
for in vivo therapies or methods are contemplated. 

Examples 

The following examples relating to this invention 
are illustrative and should not, of course, be 
construed as specifically limiting the invention. 
Moreover, such variations of the invention, now known 
or later developed, which would be within the purview 
of one skilled in the art are to be considered to fall 
within the scope of the present invention hereinafter 
claimed . 

1. Preparation of Synthetic M etal Binding Sites 

Within the Heavy Chain CDR3 Domain of a Phaaemid 
Fab Display Protein Produc ed bv a Dieistronic 
Expression Vector 

The binding sites of this invention as described 
herein were incorporated into a complementary 
determining region (CDR) in the antibody heavy chain 
of a selected phagemid expression vector. The term 
"binding site" as defined herein is any region of a 
protein or polypeptide that participates in protein - 
target molecule interactions. The CDR3 encoding a 
heavy chain Fab was randomized by PGR mutagenesis 
followed by overlap polymerase chain reaction (PCR) 
amplification to create nucleotide sequences that 
encode amino acid residue sequences that bind selected 
metals oxides or metal salts, specifically, magnetite 
(FejOJ, copper, zinc, lead, cerium and iron. These 
residues were randomly amplified into the CDR3 of the 
heavy chain as described below in either the s\irface 
Fab display phagemid expression vectors, pC3JWP313 or 
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P7EIII. selection for binding to metals was then 
performed as described in Example 3. 

Both of the phagcmid expression vectors, pC3AP313 
and P7EIII, used in this invention were deposited for 
5 co-pending application, Serial Number 08/012,566 filed 
February 2, 1993, entitled "Methods for Producing 
Polypeptide Binding Sites" . The deposited expression 
vectors, pC3AP313 and p7EIII, have been assigned the 
ATCC accession numbers 75408 and 75409. The p7EIII 
10 expression vector (also referred to as pC3-TT7E) , 
containing the sequences for encoding a human 
anti-tetanus toxin antibody where the heavy chain was 
fused to the coat protein encoded by bacteriophage 
gene 3, has been described by Barbas et al., Prec^ 
15 r^--^ US^- 88:7978-7982 (1991), the 

disclosure of which is hereby incorporated by 
reference. The vector contains the expression 
elements present in pC3AP313 as described in Example 
2, The resultant metal binding site-encoding PGR 
20 products amplified from either pC3AP313 or P7EIII were 
purified, digested and ligated into pC3AP313 for 
expression as described in Example 2. 

For preparing PGR products that encode binding 
sites that exhibited specificity for metals, two 
25 separate PGR reactions were performed followed by 

overlap PGR. In the first PGR amplification reaction, 
the 5« end of the heavy chain beginning at framework 1 
and extending to the 3 • end of framework 3 was 
amplified. In the second PGR amplification reaction, 
30 the GDR3 was mutagenized to produce sequences that 
encode metal binding sites. The amino acid residue 
position of the heavy chain GDR3 corresponded to Kabat 
numbers 94-103. This was accomplished through the use 
of a pool of oligonucleotide primers synthesized with 



a degenerate region sandwiched between and contiguous 
with conserved framework 3 and 4 region sequences. 
Degenerate oligonucleotide primers were designed for 
encoding amino acid residue sequences of 5, 10 or 16 
amino acid residues in length comprising the resultant 
CDR3. 

The amplification products resulting from the 
second PCR, each having a randomized CDR3 consisting 
of various preselected lengths as described below, 
began at the 3' end of framework 3 and extended to the 
3« end of the CHI region. The pool of degenerate 
oligonucleotide primers were designed to result in the 
amplification of products having a 5' end that was 
complementary to and overlapped with the 3' end of the 
products of the first PCR reaction product. 
Thereafter, the two separate PCR reaction products 
were pooled and subjected to a third PCR reaction in 
which the overlapping region between the two products 
was extended to result in a complete heavy chain from 
framework 1 through framework 4 having a randomized 
CDR3. 

A. PCR Amplif-i cations With D7E TTT as Template 
For amplifications where the p7EIII surface 
display expression vector was selected for the PCR 
template, the first PCR reaction resulted in the 
amplification of the region of the heavy chain 
fragment in the p7EIII phagemid beginning at framework 
region 1 and extending to the end of framework region 
3 which was located 5' to CDR3, resulting in a total 
length of approximately 400 base pairs in length. To 
amplify this region, the following primer pairs were 
used. The 5' coding oligonucleotide primer, FPX3, 
having the nucleotide sequence 5'- 
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GCAATTAACCCTCACTAAAGGG-3 • (SEQ ID NO 55), hybridized 
to the noncoding strand of the heavy chain 
corresponding to the region 5* of and including the 
beginning of framework l. The 3' noncoding 
5 oligonucleotide primer, B7EFR3, having the nucleotide 
sequence 5 • -TCTCGCACAATAATACACGGC-3 • (SEQ ID NO 56) , 
hybridized to the coding strand of the heavy chain 
corresponding to the 3* end of the framework 3 region. 
The oligonucleotide primers were synthesized by Operon 
10 Technologies, Alameda, CA. 

The PGR reaction was performed in an 100 
microliter (ul) reaction containing 1 microgram (ug) 
of each of oligonucleotide primers FTXS and B7EFR3, 8 
ul 2.5 millimolar (mM) dNTP's (dATP, dCTP, dGTP, 
15 dTTP) , 1 ul Tag polymerase (Perkin-Elmer Corp., 

Norwalk, CT) , 10 nanogram (ng) of template p7EIII, and 
10 ul of lOX PGR buffer purchased commercially 
(Perkin-Elmer Gorp.). Thirty-five rounds of PGR 
amplification in a Perkin-Elmer Cetus 9600 GeneAmp PGR 
20 System thermocycler were then performed. The 

amplification cycle consisted of denaturing at 94 
degrees C (94G) for one minute, annealing at 47C for 
one minute, followed by extension at 72G for two 
minutes. To obtain sufficient quantities of 
25 amplification product, 15 identical PGR reactions were 

performed and the products were then pooled. 

The resultant PGR amplification products 
containing the sequence for encoding an RGD peptide 
were then gel purified on a 1.5% agarose gel using 
30 standard electroelution techniques as described in 

••Molecular Cloning; A Laboratory Manual", Sambrook et 
al., eds., cold Spring Harbor, NY (1989). Briefly, 
after gel electrophoresis of the digested PGR 
amplified Fab-display encoding synthetic binding 
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Sites, the region of the gel containing the DNA 
fragments of predetermined size were excised, 
electroeluted into a dialysis membrane, ethanol 
precipitated and resuspended in buffer containing 10 
5 mM Tris-HCl [Tris(hydroxymethyl)aminoinethane- 
hydrochloride] at pH 7.5 and 1 mM EDTA 
(ethylenediaminetetraacetic acid) to a final 
concentration of 50 ng/milliliter (ng/ml) . 

The resultant purified PGR amplification products 
10 were then used in an overlap extension PGR reaction 

with the products of the second PGR reaction, both as 
described below, to recombine the two products into 
reconstructed heavy chains containing mutagenized 
CDR3S. 

15 The second PGR reaction resulted in the 

amplification of the heavy chain from the 3' end of 
framework region 3 extending to the end of CHI region 
which was approximately 390 base pairs (bp) in length. 
To amplify this region for encoding 5 random amino 

20 acid residues comprising the GDR3, the following 

primer pairs were used. The 5- coding oligonucleotide 
primer pool, designated HGDR5, had the nucleotide 
sequence represented by the formula, 

5 • -GTGTATTATTGTGGGAGANNSNNSNNSNNSNNSTGGGGCCAAGGGACCAGG 
25 -3' (SEQ ID NO 50), Where N is A, C, G, or T and S is 
either C or G. The 5* end of the primer pool is 
complementary to the 3 • end of framework 3 represented 
by the complementary nucleotide sequence of the 
oligonucleotide primer B73FR3 and the 3' end of the 
30 primer pool is complementary to the 5« end of 

framework 4. The region between the two specified 
ends of the primer pool is represented by a 15-mer 
degeneracy of 5 NNS triplets that ultimately encodes a 
diverse population of mutagenized CDR3s of 5 amino 
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acid residues in length. The 3' noncoding 
oligonucleotide primer, CGlz, as described by Persson 
et al., P^or., Natl. i^np^A. Sci.. USA. 88:2432-2436 
(1991), having the nucleotide sequence 
5 5 1 -GCATGTACTAGTTTTGTCACAAGATTTGGG-3 ' (SEQ ID NO 57), 

hybridized to the coding strand of the heavy chain 
corresponding to the 3' end of CHi. The second PGR 
reaction was performed on the p7EIII in a 100 ul 
reaction as described above containing l ug of each of 
10 oligonucleotide primers HCDR5 and CGlz. The resultant 
PGR amplification products were then gel purified as 

described above. 

One hundred nanograms of gel purified products 
from the first and second PGR reactions were then 
15 admixed with 1 ug each of FTX3 and CGlz 

oligonucleotide primers as a primer pair in a final 
PGR reaction to form a complete heavy chain fragment 
by overlap extension. The PGR reaction admixture also 
contained 10 ul lOX PGR buffer, l ul Tag polymerase 
20 and 8 ul 2.5 mM dNTP's as described above. The PGR 
reaction was performed as previously described. To 
obtain sufficient quantities of amplification product, 
15 identical PGR reactions were performed as described 
above. The resulting heavy chain fragments, beginning 
25 at framework 1 and extending to the end of CHI and 
having a randomly mutagenized CDR3 for encoding 5 
amino acid residues, were approximately 790 base pairs 
in length. The heavy chain fragment amplification 
products from the 15 reactions were first pooled and 
then gel purified as described above prior to their 
incorporation into the pG3AP313 surface display 
phagemid expression vector to form a library as 
described in Example 2. The resulting phagemid 
library was designated 7-5. 



30 



In scidition to randomizing the CDR3 in p7EIII for 
expressing 5 amino acid residues, PGR amplifications 
were performed for expressing a CDR3 containing 10 
amino acid residues. Two separate PGR amplifications 
were performed as described above with the only 
exception being that, in the second reaction, the 5' 
coding degenerate primer, designated HCDRIO, used to 
encode 10 random amino acid residues in the ca5R3 had 
the formula: 

5 • -GTGTATTATTGTGCGAGANNSNNSNNSNNSNNSNNSNNSNNSNNSNNSTGG 
GGCCAAGGGACCACG-3 • (SEQ ID NO 51), where N is A, C, G 
or T and S is C or G. The resultant products were 
pooled and purified as described above prior to 
insertion into pC3AP313 surface display phagemid 
expression vector to form a library as described in 
Example 2. The resulting phagemid library was 
designated 7-10. 

B. VCR Amplifications With pg3AP313 as Template 
For amplifications where the pC3AP313 
surface display expression vector was selected for the 
PGR template, the first PGR reaction resulted in the 
amplification of the region of the heavy chain 
fragment in the pC3AP313 phagemid beginning at 
framework region 1 and extending to the end of 
framework region 3 which was located 5* to CDR3, 
resulting in a total length of approximately 400 base 
pairs in length. The degenerate primer designed for 
use with the pC3AP313 template resulted in the 
retention of a conserved aspartic acid residue in the 
next to last position in the CDR3. The retention of 
the aspartic acid residue in this position is 
preferred for use in this invention as the expressed 
proteins containing this residue exhibit high affinity 



binding characteristics as described in Example 3. 

To amplify the 5' end of the heavy chain from 
framework 1 to the end of framework 3, the following 
primer pairs were used. The 5' coding oligonucleotide 
primer, FTX3, having the nucleotide sequence 
5 • -GCAATTAACCCTCACTAAAGGG-3 • (SEQ ID NO 55), 
hybridized to the noncoding strand of the heavy chain 
corresponding to the region 5' of and including the 
beginning of framework 1. The 3' noncoding 
oligonucleotide primer, BFR3U, having the nucleotide 
sequence 5 ' -TCTCGCACAGTAATACACGGCCGT-3 ' (SEQ ID NO 
58), hybridized to the coding strand of the heavy 
chain corresponding to the 3« end of the framework 3 
region. The oligonucleotide primers were synthesized 
by Operon Technologies. 

The PGR reaction was performed in an lOO ul 
reaction as described above with the exception that 10 
ng of template pC3AP313 were used. The resultant PCR 
amplification products were then gel purified as 
described above and used in an overlap extension PCR 
reaction with the products of the second PCR reaction, 
both as described below, to recombine the two products 
into reconstructed heavy chains containing mutagenized 
CDR3S. 

The second PCR reaction resulted in the 
amplification of the heavy chain from the 3' end of 
framework region 3 extending to the end of CHI region 
which was approximately 390 base pairs in length. To 
amplify this region for encoding a 5 random amino acid 
residue sequence having the formula NH2-XXXDX-COOH 
(SEQ ID NO 59) , where X is Xaa, any amino acid and D 
is aspartic acid, in the CDR3 , the following primer 
pairs were used. The 5« coding oligonucleotide primer 
pool, designated HCDRD5, had the nucleotide sequence 



represented by the formula, 

5 • -GCCGTGTATTACTGTGCGAGANNKNNKNNKGACNNKTGGGGCCAAGGGACC 
ACGGTC-3' (SEQ ID NO 54), where N can be A, C, G, or T 
and K is either G or T. The 5 • end of the primer pool 
is complementary to the 3* end of framework 3 
represented by the complementary nucleotide sequence 
of the oligonucleotide primer BFR3U and the 3' end of 
the primer pool is complementary to the 5' end of 
framework 4. The region between the two specified 
ends of the primer pool is represented by a 12-mer 
degeneracy of 4 NNK triplets plus a sequence encoding 
a conserved aspartic acid residue one position from 
the end of the CDR3. The 3' noncoding oligonucleotide 
primer, R3B, having the nucleotide sequence 
5«-TTGATATTCACAAACGAATGG-3' (SEQ ID NO 60), hybridized 
to the coding strand of the heavy chain corresponding 
to the 3* end of CHI. 

The sequence NNK represents the coding strand 
sequence having the complementary sequence NNM in the 
primer as read from the 3' to 5' direction. Thus, in 
the primer as listed below the noncoding strand 
sequence is MNN as read in the 5' to 3' direction. 
The coding triplet sequence NNK was designed to 
prevent the production of deleterious stop codons. 
The only stop codon that could result from the 
expression of NNK would be an amber mutation that is 
suppressed when the phagemid is expressed an 
amber-suppressing host cell, preferably E. coli supE 
strain . 

The second PGR reaction was thus performed on the 
pC3AP313 in an 100 ul reaction as described above 
containing 1 ug of each of oligonucleotide primers 
HCDRD5 and R3B. The resultant PCR products encoded a 
diverse population of mutagenized CDR3s of 5 amino 
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acid residues in length with a conserved aspartic acid 
residue in the fourth amino acid residue position in 
the CDR3. The products were then gel purified as 
described above. 
5 One hundred nanograms of gel purified products 

from the first and second PGR reactions were then 
admixed with 1 ug each of FTX3 and R3B oligonucleotide 
primers as a primer pair in a final PGR reaction to 
form a complete heavy chain fragment by overlap 

10 extension. The PGR reaction admixture also contained 
10 ul lOX PGR buffer, 1 ul Taq polymerase and 8 ul 2.5 
mM dNTP's as described above. The PGR reaction was 
performed as previously described. 

To obtain sufficient quantities of amplification 

15 product, 15 identical PGR reactions were performed. 
The resulting heavy chain fragments beginning at 
framework 1 and extending to the end of GHl and having 
a randomly mutagenized GDR3 for encoding 5 amino acid 
residues with a conserved aspartic acid residue were 

20 approximately 790 base pairs in length. The heavy 
chain fragment amplification products from the 15 
reactions were first pooled and then gel purified as 
described above prior to their incorporation into the 
pG3AP313 surface display phagemid expression vector to 

25 form a library as described in Example 2. The 

resulting phagemid library was designated library G. 

In addition to randomizing the GDR3 in pG3AP313 
for expressing 5 amino acid residues, PGR 
amplifications were performed for expressing a GDR3 

30 containing 10 amino acid residues. Two separate PGR 
amplifications were performed as described above with 
the only exception being that, in the second reaction, 
the 5* coding degenerate primer^ designated HCDRDIO, 
used to encode 10 amino acid residues comprising the 
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heavy chain CDR3. The degenerate 5' coding primer 
used here was designed to retain the first amino acid 
position of a glycine residue in the PC3AP313 template 
and incorporate a conserved aspartic acid residue in 
5 the ninth amino acid position. The HCDRDIO primer had 
the formula: 

5 • -GCCGTGTATTACTGTGCGAGAGGTNNKNNIOTOKNNKIWKNNKNNKGACNNK 
TGGGGCCAAGGGACCACGGTC-3 • (SEQ ID NO 52) , where N is A, 
C, G or T and K is G or T. The amino acid sequences 

10 comprising the CDR3 encoded by the use of the HCRDIO 
primer had the formula NH2-GXXXXXXXDX-COOH (SEQ ID NO 
61) , where X is Xaa, any amino acid, G is glycine and 
D is aspartic acid. The resultant products were 
pooled and purified as described above prior to 

15 insertion into pC3AP313 surface display phagemid 

expression vector to form a library as described in 
Example 2. The resulting phagemid library was 

designated library F. 

PGR amplifications using the template pC3AP313 

20 were also performed for expressing a randomized CDR3 
containing 16 amino acid residues. The degenerate 5' 
coding primer used for this amplification was designed 
to retain the first amino acid position of a glycine 
residue in the PC3AP313 template and incorporate a 

25 conserved aspartic acid residue in the fifteenth amino 

acid position. Two separate PGR amplifications were 
performed as described above for the CDR3 having 5 
amino acids with the only exception being that, in the 
second reaction, the 5' coding degenerate primer, 

30 designated HCDRD16, used to encode 16 random amino 
acid residues had the formula: 

5 • -GCCGTGTATTACTGTGCGAGAGGTNNKNNKNNKNNKNNKNNKNNKNNKNNK 
NNKNNKNNKNNKGACNNKTGGGGCCAAGGGACCACGGTC-3' (SEQ ID NO 
53), where N is A, C, G or T and K is G or T. The 
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amino acid sequences comprising the CDR3 encoded by 
the use of the HCRD16 primer had the formula 
NH2-GXXXXXXXXXXXXXDX-COOH (SEQ ID NO 62), where X is 
Xaa, any amino acid, G is glycine and D is aspartic 
5 acid. The resultant products were pooled and purified 

as described above prior to insertion into pC3AP313 
surface display phagemid expression vector to form a 
library as described in Example 2. The resulting 
phagemid library was designated library E. 

10 As described above, the resultant metal binding 

site encoding PGR products, having heavy chain CDR3s 
of various lengths amplified from either pC3AP3l3 or 
P7EIII, were purified, digested and ligated into 
PC3AP313 for preparation of an expression library as 

15 described in Example 2. 

2. Production r.f PhaaeiP ^H Fab-Disnl aved Synthetic 
Binding Sites 

In practicing this invention to obtain expression 
20 of Fab-display proteins containing a synthetic binding 

site on a phage surface, the heavy (Fd consisting of 
Vh and CHI) and light (kappa) chains (V^, Cj.) of 
antibodies were first targeted to the periplasm of E^ 
coii for the assembly of heterodimeric Fab molecules. 
25 In this system, the first cistron encoded a 

periplasmic secretion signal (pelB leader) operatively 
linked to the fusion protein, Fd-cpIII. The second 
cistron encoded a second pelB leader operatively 
linked to a kappa light chain. The presence of the 
30 pelB leader facilitated the coordinated but separate 
secretion of both the fusion protein containing the 
synthetic binding site and light chain from the 
bacterial cytoplasm into the periplasmic space. 

In this process, each chain was delivered to the 
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periplasmic space by the pelB leader sequence, which 
was subsequently cleaved. The heavy chain containing 
the synthetic binding was anchored in the membrane by 
the cpIII membrane anchor domain while the light chain 
5 was secreted into the periplasm. Fab molecules were 
formed from the binding of the heavy chain with the 
soliible light chains. 

A. PreparatTon of a Dicistr onic Expression 

10 Vector. pCQmb3. Capable o f Expressing a 

Phaaem.id Fab p -iaplay Protein 
The pComb3 phagemid expression vector was 
used for expressing the synthetic binding 
site-containing antibodies of this invention. The 

15 antibody Fd chain comprising variable (Vg) and 

constant (CHI) domains of the heavy chain were fused 
with the C-terminal domain of bacteriophage gene III 
(3) coat protein. Gene III of filamentous phage 
encodes a 406-residue minor phage coat protein, cpIII 

20 (cp3), which is expressed prior to extrusion in the 

phage assembly process on a bacterial membrane and 
accumulates on the inner membrane facing into the 
periplasm of E. coli. 

The phagemid vector, designated pComb3, allowed 

25 for both surface display and soluble forms of Fabs. 
The vector was designed for the cloning of 
combinatorial Fab libraries. The Xho I and Spe I 
sites were provided for cloning complete PCR-amplified 
heavy chain (Fd) sequences consisting of the region 

30 beginning with framework 1 and extending through 
framework 4. An Aat II restriction site is also 
present. The presence of the Aat II site allows for 
the insertion of Xho I /Aat II digests of the PGR 
products prepared in Example 1 that contain sequences 
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beginning with framework l and extending to the end of 
the CDR3 domain in which the sequences for encoding 
the synthetic binding sites are located. The 
insertion of an Xho I/Aat II digest into pC3AP313 
5 results in the fusion of the insert with the framework 
4 domain in the pC3AP313 vector. Thus, the insertion 
results in the in-frame ligation of a complete heavy 
chain fragment consisting of PGR amplified framework 1 
through CDR3 and pC3AP3 13 -retained framework 4. The 
10 Sac I and Xba I sites were provided for cloning PGR 
amplified antibody light chains. The cloning sites 
were compatible with previously reported mouse and 
human PCR primers as described by Huse et al., 
science . 246:1275-1281 (1989) and Persson et al., 
15 K.-hl. A^ f^- Sc^i.. USA. 88:2432-2436 (1991). The 

nucleotide sequence of the pelB, a leader sequence for 
directing the expressed protein to the periplasmic 
space, was as reported by Huse et al., supra . 

The vector also contained a ribosome binding site 
20 as described by Shine et al., Nature , 254:34 (1975). 
The sequence of the phagemid vector, pBluescript, 
which includes ColEl and Fl origins and a 
beta-lactamase gene, has been previously described by 
Short et al., K.nr,- Acids Res. . 16:7583-7600 (1988) and 
25 has the GenBank Accession Number 52330 for the 

complete sequence. Additional restriction sites, Sal 
I, Acc I, Hinc II, Cla I, Hind III, Eco RV, Pst I and 
Sma I, located between the Xho I and Spe I sites of 
the empty vector were derived from a 51 base pair 
stuffer fragment of pBluescript as described by Short 
et al., supra . A nucleotide sequence that encodes a 
flexible 5 amino acid residue tether sequence that 
lacks an ordered secondary structure was juxtaposed 
between the Fab and cp3 nucleotide domains so that 
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interaction in the expressed fusion protein was 
minimized. 

Thus, the resultant combinatorial vector, pComb3, 
consisted' of a DNA molecule having two cassettes to 
express one fusion protein, rd/cp3, and one soluble 
protein, the light chain. The vector also contained 
nucleotide residue sequences for the following 
operatively linked elements listed in a 5' to 3' 
direction: a first cassette consisting of LacZ .= 
promoter/operator sequences; a Not I restriction site; 
a ribosome binding site; a pelB leader; a spacer 
region; a cloning region bordered by 5- Xho I and 3' 
Spe I restriction sites; the tether sequence; the 
sequences encoding bacteriophage cp3 followed by a 
15 stop codon; a Nhe I restriction site located between 
the two cassettes; a second lacZ promoter /operator 
sequence followed by an expression control ribosome 
binding site; a pelB leader; a spacer region; a 
cloning region bordered by 5' Sac I and a 3- Xba I 
20 restriction sites followed by expression control stop 
sequences and a second Not I restriction site. 

in the above expression vector, the Fd/cp3 fusion 
and light chain proteins were placed under the control 
of separate lac promoter/ operator sequences and 
25 directed to the periplasmic space by pelB leader 
sequences for functional assembly on the membrane. 
Inclusion of the phage Fl intergenic region in the 
vector allowed for the packaging of single-stranded 
phagemid with the aid of helper phage. The use of 
30 helper phage superinfection allowed for the expression 
of two forms of cp3. Consequently, normal phage 
morphogenesis was perturbed by competition between the 
Fd/cp3 fusion and the native cp3 of the helper phage 
for incorporation into the virion. The resulting 
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packaged phagemid carried native cp3, which is 
necessary for infection, and the encoded Fab fusion 
protein, which is displayed for selection. Fusion 
with the C-terminal domain was necessitated by the 
5 phagemid approach because fusion with the infective 

N-terminal domain would render the host cell resistant 

to infection. 

The pComb3 expression vector described above 
forms the basic construct of pC3AP313 and p7EIII Fab 

10 display phagemid expression vectors used in this 

invention for the production of human Fab antibodies 
containing synthetic metal binding sites. The 
PC3AP313 and P7EIII phagemid expression vectors 
described in Example 1, having the respective ATCC 

15 accession numbers 75408 and 75409 as described 

■ earlier, are pComb3-based vectors that were originally 
used to screen for heavy and light chain sequences 
encoding human Fab antibodies against tetanus toxin. 

20 B. pi-e paratinn of Exr > T->^ssion Vector Libraries 

■For the Expression of t*"? Phaaemid 
vah-nis plav P roteins 
1) Phagemid T.-ibrarv ronsl-TUction 

In order to obtain expressed human Fab 
25 antibodies having both heavy and light chain 

fragments, phagemid libraries were constructed. The 
libraries provided for the expression of recombinant 
human Fab antibodies having heavy and light chains 
where the synthetic binding sites of this invention 
30 are displayed in the heavy chain CDR3. The PCR 

products resulting from each of the amplification 
reactions prepared in Example 1 were separately 
inserted into a phagemid expression vector to prepare 
phagemid libraries. As described below, the resultant 
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gel purified heavy chain PGR fragments prepared in 
Example 1 were digested with the restriction enzymes 
and separately ligated into the pC3AP313 phagemid 
expression vector that was similarly digested. 
5 For preparation of phagemid libraries for 

expressing Fab display proteins that bind to selected 
metals, the PGR products prepared in Examples lA and 
IB were digested with Xho I and Spe I and ligated with 
a similarly digested pC3AP313 phagemid expression 

10 vector prepared as described in Examples 1 and 2A. 
The ligation resulted in operatively linking the 
framework 1 through framework 4 PGR products with the 
CHI region present in the pC3AP313 vector. 

Phagemid libraries for expressing the Fab display 

15 synthetic metal binding sites of this invention from 
each of the final 5 PGR amplification products 
produced above were prepared in the following 
procedure. To form circularized vectors containing 
the PGR product insert, 640 ng of the digested PGR 

20 products were separately admixed with 2 ug of the 

linearized pG3AP313 phagemid vector and ligation was 
allowed to proceed overnight at room temperature using 
10 units of BRL ligase (Gaithersburg, MD) in BRL 
ligase buffer in a reaction volume of 150 ul. Five 

25 separate ligation reactions were performed to increase 
the size of each separate phage library for selecting 
synthetic metal binding sites in a heavy chain CDR3. 

Following the ligation reactions, the 
circularized DNA was precipitated at -20C for two 

30 hours by the admixture of 2 ul of 20 mg/ml glycogen, 
15 ul of 3 M sodium acetate at pH 5.2 and 300 ul of 
ethanol. DNA was then pelleted by microcentrifugation 
at 4G for 15 minutes. The DNA pellet was washed with 
cold 70% ethanol and dried under vacuum. The pellet 
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was resuspended in lO ul of water and transformed by 
electroporation into 300 ul of XLl-Blue cells 

to fona a phage library. The total yield from the PCR 
amplification and transformation procedure described 
5 herein was approximately 1 X 10« transformants from 

each library. 

After transformation, to isolate phage on whxch 

Fabs displaying synthetic binding sites had been 

induced for subsequent panning or selection on metal 

10 resins as described in Example 3, 3 ml of SOC medium 

(SOC was prepared by admixture of 20 grams (g) 

bacto-tryptone, 5 g yeast extract and 0.5 g NaCl in 1 

liter of water, adjusting the pH to 7.5 and admixxng 

20 ml of glucose just before use to induce the 

15 expression of the Fd-cpIII and light chain 

heterodimer) were admixed and the culture was shaken 
at 220 rpm for 1 hour at 37C, after which 10 ml of SB 
were admixed(SB was prepared by admixing 30 g 
tryptone, 20 g yeast extract, and 10 g Mops buffer per 

20 liter with pH adjusted to 7) containing 20 ug/ml 
carbenicillin and 10 ug/ml tetracycline. The 
admixture was then shaken at 300 rpm for an additional 
houiT • 

This resultant admixture was admixed to 100 ml SB 
containing 50 ug/ml carbenicillin and 10 ug/ml 
tetracycline and shaken for 1 hour, after which helper 
phage VCSM13 (10^^ pfu) were admixed and the admixture 
was shaken for an additional 2 hours. After this 
time, 70 ug/ml kanamycin was admixed and maintained at 
SOC overnight. The lower temperature resulted in 
better heterodimer incorporation on the surface of the 
phage. The supernatant was cleared by centrifugation 
(4000 rpm for 15 minutes in a JAIO rotor at 4C) . 
Phage were precipitated by admixture of 4% (w/v) 
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polyethylene glycol 8000 and 3% (w/v) NaCl and 
maintained on ice for 30 minutes, followed by 
centrifugation (9000 rpm for 20 minutes in a JAIO 
rotor at 40) . Phage pellets were resuspended in 2 ml 
5 of PBS and microcentrifuged for three minutes to 

pellet debris, transferred to fresh tubes and stored 
at -*20C for subsequent screening as described below. 

For determining the titering colony forming units 
(cfu) , phage (packaged phagemid) were diluted in SB 
10 and 1 ul was used to infect 50 ul of fresh (AOD600 = 
1) E> coli XLl-Blue cells grown in SB containing 10 
ug/ml tetracycline. Phage and cells were maintained 
at room temperature for 15 minutes and then directly 
plated on LB/carbenicillin plates. 

15 

3. Selection of the Phagemid Fab-Disolaved Synthetic 
Binding Site Proteins 

A. Multiple Pannings of the Phage Library 
Having Phagemid Fab-Displayed Synthetic 

20 Metal Binding Site Proteins 

The five phage libraries produced in Example 
2 having heavy chain fragments with Fab display 
synthetic binding site regions were panned as 
described herein on a microtiter plate coated with 

25 selected target molecules. The target molecules used 

in screening for the phagemid-anchored Fab-displayed 
synthetic metal binding site proteins were magnetite 
and the metal salts of copper, zinc, lead, cerium and 
iron (Aldrich Chemicals, Milwaukee, WI) . 

30 Selection for binding to copper (II) and magnetite 

utilized all five phage libraries in a competitive 
fashion such that in the first round of selection 
libraries were kept separate until the elution step 
when the binding phage that were eluted were 
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subsequently pooled for amplification and subsequent 
selections. Sequence analysis of the resulting 
binding phage revealed that all were derived from 
libraries E and F, respectively encoding a 16 and 10 
5 amino acid residue CDR3 , resulting from the 

mutagenesis with the oligonucleotides containing the 
NNK nucleotide triplet degeneracy. Library 7-16 was 
the original source of semisynthetic fluorescein 
binding Fabs. 

10 In subsequent selections , only libraries E and F 

. were examined. These libraries retained the aspartic 
acid residue (p) in the penultimate position (next to 
last) in the encoded HCDR3 and their success supports 
the structural significance of this residue within 

15 HCDR3 . Although the degenerate oligonucleotides used 
for amplifying the E and F libraries were designed to 
encode a glycine (G) amino acid residue in the first 
position of the heavy chain CDR3, in some selected 
metal binding antibodies, a serine (S) amino acid 

20 residue occupied that position. For a listing of the 
resultant heavy chain CDR3 amino acid sequences, sle "^ 
section Bl on "Metal Binding Site Polypeptides". The 
presence of the serine was the result of nonf idelity 
of the PGR amplification. The antibodies containing a 

25 serine in the first position nevertheless were 

selected by panning as described below against 
particular metals indicating that the serine was not a 
deleterious amino acid substitution. 

Selection for metal binding using sepharose beads 

30 proceeded in an analogous fashion to biopanning 

described by Parmley et al., Gene, 73:305-318 (1988) 
where the target antigen was immobilized on a 
microtiter plate. Chelating sepharose (10 mg dry 
weight) was loaded by suspension in a 10 mM solution 
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of CUSO, ZnCl2, CeClj, reClj, or Pb(N03)2 in water. 
Excess metal was removed by washing the resin twice 
with TBS. Phage libraries prepared in Example 2 in 25 
mM Mops containing 150 mM NaCl and 1% BSA were applied 
to the loaded sepharose and maintained at 37C with 
shaking for 2 hours. For the initial selection, beads 
were washed once with Mops buffered saline, 
subsequently, the reacted beads were washed 5 times 
for rounds 2 and 3 and 10 times for each additional 
round of selection. Bound phage were eluted once with 
elution buffer supplemented with l mM metal and once 
with 50 mM EDTA to remove excess metal from the resin, 
the ten-fold increase in background binding which was 
presumably due to the increased surface area of the 
beads necessitated additional rounds of selection, 
generally 5 to 6 as compared to 3 to 4 for binding to 
protein antigens and small organic haptens. 

Eluted phage were used to infect 2 ml of fresh 
(OD =1) E. coll XLi-Blue cells for 15 minutes at 
20 room temperature, after which 10 ml of SB containing 

20 ug/ml carbenicillin and 10 ug/ml tetracycline was 
admixed. Aliquots of (20, 10, and 1/10 ul were 
removed for plating to determine the number of phage 
(packaged phagemids) that were eluted from the plate. 
25 The culture was shaken for 1 hour at 37C, after which 
it was added to 100 ml of SB containing 50 ug/ml 
carbenicillin and 10 ug/ml tetracycline and shaken for 
1 hour. Then helper phage VCSM13 (10« pfu) were 
added and the culture was shaken for an additional 2 
30 hours. After this time, 70 ug/ml kanamycin was added 
and the culture was incubated at 37C overnight. Phage 
preparation and further panning were repeated as 

described above. 

Following each round of panning, the percentage 
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yield of phage must be determined, where % yield - 
(number of phage eluted/number of phage applied) X 
100. 

The final phage output ratio was determined by 
5 infecting two ml of logarithmic phase XLl-Blue cells 
as described above and plating aliquots on selective 
plates. In the first panning, approximately lo" 
phage were applied to four wells and approximate 
yields ranged from 5.0 to 8.0 X 10^ e luted phage. 

10 After the fourth panning eluted phage ranged from 1.0 
X 10* to 1.0 X 10* phage. From this procedure, clones 
were selected from each of the Fab libraries for their 
ability to bind to their respective selected target 
proteins. The panned phage surface libraries were 

15 then converted into ones expressing soluble 

Fab-displayed synthetic binding site proteins for 
further characterization as described in Example 4. 

B. Metal Chelate Affinitv Ch romatoaT-anhv of the 
20 Phage Library for Phaoe mid Fab-Displavcd 

synthetic Metal Bindin g Site Proteins 
For selecting a magnetite-specific Fab 
displayed synthetic metal binding site protein of this 
invention, the five phage libraries prepared in 
25 Example 2 were applied to affinity columns on which 
selected metals had been immobilized. For this 
selection procedure, metal chelate affinity 
chromatography was performed using Pharmacia HiTrap 
columns according to manufacturer's instructions 
30 (Pharmacia) . HiTrap columns are packed with 1 or 5 ml 

chelating Sepharose High Performance, a newly 
developed matrix. The columns are made of 
polyethylene which is biocompatible and 
non- interactive with biomolecules . The amino acids 
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histidine, cysteine and tryptophan, present in almost 
every protein, allows for the foroation of complexes 
with many transition metal ions. Thus, the chelating 
Sepharose High Performance, charged with selected 
5 metal ions, selectively retains proteins if the 

complex-forming amino acid residues are exposed on the 
protein surface. The Sepharose used in the columns 
consists of highly cross-linked agarose beads coupled 
by stable ether groups to iminodiacetic acid via 
10 7-atom spacer arms. The coupling technique ensures 
both high capacity and performance while minimizing 
leakage of the iminodiacetic groups. 

For selecting magnetite (FegOJ -specific 
Fab-displaying synthetic binding sites, magnetite was 
coupled to the chelating Sepharose High Performance 
following the manufacturer's instructions. The phage 
preparations from each of the phage libraries prepared 
in Example 2 were separately applied to the prepared 
column. To 100 ul of suspended metal complexed resin, 
500 ul of 10" pfu/ml of phage were added. After 
allowing the phage-anchored Fabs to immobilize to the 
metal, the column was washed 5 times with TBS 
containing 0.1% Tween-20. The metal-specific 
immobilized phage-anchored Fabs were then eluted with 
20 ul of acid (P.l M HCl, adjusted to pH 2.2 with 
glycine, containing BSA at 1 mg/ml) . Alternatively, 
the elution was performed with 200 ul of 50 mM EDTA. 
Thereafter, the eluted phage were infected into 
bacteria in order to prepare soluble Fab-displayed 
30 binding site proteins. 

An alternative method for selection of 
magnetite-binding Fab antibodies expressed on the 
surface of phage was performed by applying phage 
libraries prepared as described above to 1 mg of 



20 



25 
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magnetite in a microfuge tube. The resulting 
suspension was shaken at 37C for 2 hours. Magnetite 
was pelleted by centrifugation or magnetic force. 
Magnetite was washed 5 times with TBS-tween in the 
5 first 3 rounds of selection and 10 times in the last 3 
rounds of selection. Bound phage were eluted with the 
standard acidic elution buffer. Subsequent steps were 
performed as described above. 

10 C. Pi-*>paration of Solubl e Fab-Displaved Metal 

Rinding p ifp Proteins 
In order to further characterize the 
specificity "of the Fab-displayed synthetic binding 
site proteins ei^ressed on the surf ace of phage as 

15 described . above/ soluble heterodimers were prepared 
and analyzed in ELISA assays on target-coated plates 
and by competitive ELISA with increasing 
concentrations of soluble competitor as described 
below . 

20 To prepare soluble Fabs consisting of heavy and 

light chains (i.e., heterodimers) , phagemid DNA from 
positive clones selected in Examples 3A and 3 B above 
was isolated and digested with Spe I and Nhe I. 
Digestion with these enzymes produced compatible 

25 cohesive ends. The 4.7 kilobase (kb) DNA fragment 
lacking the glll portion was gel-purified (0.6% 
agarose) and self-ligated. Transformation of p. coli 
XLl-Blue afforded the isolation of recombinants 
lacking the glll fragment. Clones were examined for 

30 removal of the glll fragment by Xho I/Xba I digestion, 
which should yield an 1.6 kb fragment. Clones were 
grown in 100 ml SB containing 50 ug/ml carbenicillin 
and 20 mM MgClj at 37C until an OD^go of 0.2 was 
achieved. IPTG (1 mM) was added and the culture grown 
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overnight at 30C (growth at 37C provides only a light 
reduction in heterodimer yield) . Cells were pelleted 
by centrifugation at 4000 rpn for 15 minutes in a JAlO 
rotor at 4C. Cells were resuspended in 4 ml PBS 
containing 34 ug/ml phenylmethylsulfonyl fluoride 
(PMSF) and lysed by sonication on ice (2-4 minutes at 
50% duty) . Debris was pelleted by centrifugation at 
14,000 rpm in a JA20 rotor at 4C for 15 minutes. The 
supernatant was used directly for ELISA analysis and 
was stored at -20C. For evaluating a large number of 
clones, 10 ml cultures provided a sufficient amount of 
Fab-displayed synthetic binding site proteins for 
analysis, in this case, sonications were performed in 

2 ml of buffer. 

The soluble heterodimers prepared above were 
assayed by ELISA where applicable as described in 
Example 4. 

4. n>..^.r-A'^.r-izat ^'^r ^ »^ snlubl P Fabs Containing Metal 

20 Bipriina Sites 

A. no.h«.T^inat j »n of Specificity of the Metal 

Rinding S ^^-p PToteins 
1) ELISA 

TO assay metal binding specificity of 
25 soluble Fabs prepared in Example 3C, ELISA was 

performed using a bovine serum albumin-iminodiacetic 
acid (BSA-IDA) conjugate. A 10 ml solution of 0.1 M 
carbonate buffer at pH 9 was prepared containing 10 
mg/ml BSA. A 0.9 mM solution of 1,4-Butanediol 
30 diglycidyl ether was added to the BSA-containing 
solution and allowed to react for 4 hours at 37C. 
Iminodiacetic acid, 1.8 mM, neutralized with NaOH and 
dissolved in 3ml of O.IM bicarbonate at pH 9 was then 
added to the protein containing solution and allowed 



wo 94/18220 



- 95 - 



PCTAJS94/01238 



to react overnight at 37C. The conjugate was purified 
by passage over a Pharmacia PD-10 Sephadex column 
(Pharmacia) . 

Once the BSA-IDA conjugate was prepared, it was 
5 diluted in TBS to 1 ug/ml and was applied to each well 
of a 96 well ELISA plate. The plate was maintained 
overnight at 4C or for l hour at 37C to allow for the 
conjugate to bind to the plastic. The coating 
solution was then removed and 50 ul of 1% BSA in TBS 
10 were applied. After blocking for 30 minutes at 37C 
and removal of the BSA solution, 40 ul of 1 mM EDTA 
were applied to all wells to chelate preexisting metal 
ions. After removal of this solution, 40 ul of a 1 mM 
: solution of a metal salt, either MgClj, NiClj, CuClj, 
15 ZnCl, PbCl, CeHzO,, and FeCla, in TBS were separately 
added to allow for the formation of separate 
metal-loaded BSA-IDA complexes. Following metal 
loading, the plate was washed 5 times with purified 
water, dried and soluble Fab antibodies prepared in 
20 Example 3 C were separately added to allow for the 

formation of immunoreaction products. After two hours 
at 37 C, the plate was washed 10 times with PBS-tween 
0.05% to remove unbound soluble Fab. Goat anti-human 
IgG F(ab» )2 °o^P^®^ ^° alkaline phosphatase, 40 ul of 
25 a 1 ug/ml solution in 1% BSA/TBS were then applied to 
each well to provide for labeling of the primary 
immunoreaction product. After 30 minutes at 37C, the 
plate was washed 10 times with PBS-tween 0.05%, and 
the substrate, p-nitrophenyl phosphate, diluted to 
30 0.1% in 10% diethanolamine containing 0.01% MgClj at 

pH 9.8 was added to allow for the detection of the 
primary immunoreaction products through the labeled 
reacted secondary antibody. Optical density was 
measured at 405nm. 
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Three cu(II) selected antibodies, cu-1, Cu-2 and 
Cu-11, were prepared in soluble form and purified for 
use in above ELISA. The results are shown in Figure 1 
where all three Fabs bound Cu(II) loaded BSA-IDA 
5 significantly over background. Interestingly, two of 
these demonstrated a marked preference for nickel 
(Ni(II)) over all the other metals examined. Each of 
the three Fabs had unique specificity characteristics, 
as one might expect from a random selection procedure. 
10 These results highlight the diversity which still 

exists in the metal binding sublibrary. The binding 
specificity of the Pb(II) and Ce(III) selected Fabs 
was also confirmed by ELISA. 

2) Tg«i-iinai:ion ^he Formation Constant 
Formation constants were estimated for 
Fab cu-2 binding to nickel-loaded BSA-IDA based on the 
ELISA results. The BSA-IDA nickel loaded conjugate 
was prepared as described above. The formation 
20 constant was determined by a competition ELISA 

experiment. A stock solution of 2 x lO'S M BSA-IDA was 
saturated with ViCl^ in TBS. The resultant 
metal-loaded BSA-IDA solution was then serially 
diluted to 2 X lo " M. one hundred microliters of 
25 each BSA-IDA-Ni solution were pre-maintained with 100 
ul of 2 ug/ml of Fab cu-2 to give final BSA-IDA-Ni 
concentrations of lO'^^, lO"', lO'^, 10''. lO'S 5 x lO"' 
and 10-5 Forty microliters of the resultant Cu-2 
Fab-Ni-BSA-IDA complexes were then applied in 
30 quadruplicate to ELISA wells that had been previously 
coated with Ni-BSA-IDA. Normal ELISA protocols were 
then followed as described above. The formation 
constant was defined as the reciprocal of the 
concentration of BSA-IDA-metal that reduced 0D„5 to 



half maxinal. Assays were also performed with the 
addition of EDTA to compete the binding of soluble 
Fabs containing metal binding sites to metal-loaded 

substrates. 

The formation constant of clone Cu-2 for 
Ni-BSA-IDA was estimated by competition analysis to be 
10"' M. Previous studies indicate that the formation 
constant for a single histidine with Cu(II) 
IDA-polyethylene-glycol is approximately lO'^ M as 
described by Suh et al., Biotechnol. Bio. Engng., 
33:682-690 (1990). In the instant invention, 
metal-dependent binding was abolished by addition of 
EDTA and competed with excess free metal. 

B. semienee heteinnina tion of the Binding Site 
Proteins 

Nucleic acid sequencing was performed on 
double-stranded DNA using Sequenase 1.0 (USB, 
Cleveland, OH) encoding the specific soluble 
Fab-displayed synthetic metal binding site proteins of 
this invention characterized above. The amino acid 
residue sequences and analysis thereof for each of the 
specific Fab-displayed synthetic binding site proteins 
specific for magnetite, copper, zinc, lead, cerium, 
and iron metal salts were listed earlier in 
specification in Section Bl entitled "Metal Binding 
Site Polypeptides". 

The selected antibodies were either derived from 
library E or F in which the CDR3 was 16 or 10 amino 
acid residues in length, respectively, containing an 
aspartic acid (D) amino acid residue in the 
penultimate position. Analysis of the sequences of 
the selected antibodies confirms the success of the 
selection strategy as shown for example by the 
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Cu (II) -Chelating sequences. Based on the knowledge of 
characterized copper containing protein structtires, 
the expected ligands are His, Met, and Cys. A total 
of 106 positions within the 11 sequenced Cu(II) 
5 selected CDRs were randomized with the NNK-encoded 

degenerate oligonucleotides designed as described in 
Example IB. Histidine was selected at 26 positions, 
Met at 6, and Cys at 3. The NNK degeneracy provided 
each of these residues at only one part in 32 in the 

10 unselected library. One clone contained an unpaired 
cys residue which is a rare feature in antibody CDRs 
as described by Kabat et al. , -Sequences of Proteins 
of Immunological Interest". 5th Ed. , (N.I.H., 
Washington, D.C. , (1991). 

15 By utilizing IDA immobilized with a 12 atom 

spacer, selection should not be limited to surface 
exposed residues. Three clones contained an His at 
position 102 which is predicted not to be on the 
surface of the protein as described by Chothia et al. , 

20 , T. MoT. Biol. . 196:901-917 (1987). The two His rich 
clones, Cu-1 and Cu-2, demonstrated a preference for 
Ni (11) -loaded BSA-IDA while only the Trp rich clone, 
Cu-11, showed selectivity for Cu(II) as shown in 
Figure l. 

25 sequences were analyzed for antibodies that were 

selected for binding to Zn(Il), Pb(II) , Ce(III) , and 
Fe(III). Zn (II) -selected sequences show the His rich 
character seen for Cu{II) and one sequence which is 
identical to that selected with Cu(II) . Only one 
clone was rich in carboxy lates . Selection for His, 
cys. Asp, and Glu is expected based on studies of 
natural proteins as described by Vallee et al. , 
Biochem. . 29:5647-5659 (1990). The coordinating 
ligands from sequences selected for binding to Pb(II) , 



30 
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Ce(III), and Fe(III) were less obvious. A number of 
features can however be noted. Pb(II) sequences were 
rich in Asp and Glu (10% of randomized residues) . For 
comparison, Ce(III), Fe(III), and Cu(II) selected 
5 sequences contained these residues at 7%, 4%, and 5%, 
respectively. The oxygen-containing ligands Ser, Thr, 
and Tyr constituted 18% of Pb(Il) and Ce(III) 
sequences as compared to 11 and 9% for Fe (III) and 
Cu(II). The amino acid residues, Ser and Thr, were 
10 over-represented at the synthesis level and were 

present at 3 and 2 parts per 32. Fe(III) sequences 
were enriched to the 10% level for the sulfur 
containing residues Cys and Met. Coordination with 
main chain carbonyl oxygens or amide nitrogens cannot 
15 be discounted and would not be discernible from 
sequence analysis . 

Thus, within the pool of 20 commonly occurring 
amino acids almost half were observed to participate 
in metal ligation within proteins (Asp, cys, Glii, His, 
20 Met, Ser, Thr, Tyr , and Trp) . Main chain carbonyl 
oxygens and amide nitrogens were also be utilized to 
coordinate metals. Most protein engineering efforts 
utilize design strategies based on analogy as 
described by Tainer et al., Curr. Op in. Biotech. , 
25 2:582-591 (1991). While such a strategy may be 

sufficient for the transfer of known metal binding 
motifs into alternative proteins, it limits ones 
ability to explore and exploit novel reactivities for 
most of the periodic table. The ability to 
30 selectively sort proteins in vitro for metal ligation 
has the distinct advantage that it is not limited in 
scope by natural phylogenies. Furthermore, the 
approaches of screening for metal binding sites in 
this invention provides for the ability to generate a 
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variety of ligating groups that provide a multiplicity 
of coordination numbers and geometries, both of vhich 
alter the reactivity of the bound metal. 

The efficiency with which the first stage of an 
5 iterative approach to achieving catalytic 

metalloantibodies is demonstrated in this invention. 
A benefit of such an evolutionary approach, as opposed 
to a designed template approach, is that sublibraries 
can be created whose members display a diversity of 

10 coordinating residues in a range of potentially useful 
contexts. The advantage of creating such sublibraries 
stems from the power of developing selective 
approaches that allow rare but interesting members to 
be further selected based on criteria that can be 

15 entirely different than those used to create the 
original library. 

5. Deposit of Materials 

The following cell lines and plasmids have been 
20 deposited on or before February 2, 1993, with the 

American Type Culture Collection, 1301 Parklawn Drive, 
Rockville, MD, USA (ATCC) : 

Materia] ATCC Accession 

25 No^ 

Plasmid PC3AP313 ATCC 75408 

Plasmid P7EIII ATCC 75409 

These deposits were made under the provisions of 
30 the Budapest Treaty on the International Recognition 
of the Deposit of Microorganisms for the Purpose of 
Patent Procedure and the Regulations thereunder 
(Budapest Treaty) . This assures maintenance of a 
viable culture for 30 years from the date of deposit. 
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The deposits will be made available by ATCC under the 
terms of the Budapest Treaty which assures permanent 
and unrestricted availability of the progeny of the 
culture to the public upon issuance of the pertinent 
5 U.S. patent or upon laying open to the public of any 
U.S. or foreign patent application, whichever comes 
first / and assures availability of the progeny to one 
determined by the U.S. Commissioner of Patents and 
Trademarks to be entitled thereto according to 35 
10 U.S.C. §122 and the Commissioner's rules pursuant 
thereto (including 37 CFR §1.14 with particular 
reference to 886 OG 638) . The assignee of the present 
application has agreed that if the culture deposit 
should die or be lost or destroyed when cultivated 
15 under suitable conditions, it will be promptly 

replaced on notification with a viable specimen of the 
same culture. Availability of the deposited strain is 
not to be construed as a license to practice the 
invention in contravention of the rights granted under 
20 the authority of any government in accordance with its 
patent laws. 

The foregoing written specification is considered 
to be sufficient to enable one skilled in the art to 

25 practice the invention. The present invention is not 
to be limited in scope by the cell lines and plasmids 
deposited, since the deposited embodiment is intended 
as a single illustration of one aspect of the 
invention and any cell lines or plasmid vectors that 

30 are functionally equivalent are within the scope of 
this invention. The deposit of material does not 
constitute an admission that the written description 
herein contained is inadecjuate to enable the practice 
of any aspect of the invention, including the best 
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mode thereof, nor is it to be construed as limiting 
the scope of the claims to the specific illustration 
that it represents. Indeed, various modifications of 
the invention in addition to those shown and described 
herein will become apparent to those skilled in the 
art from the foregoing description and fall within the 
scope of the appended claims. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: ^ , . 

(A) NAME: The Scripps Research Institute 

(B) STREET: 10666 North Torrey Pines Road 

(C) CITY: La Jolla 

(D) STATE: California 

(E) COUNTRY: United States of America 

(F) POSTAL CODE (ZIP) : 92037 

(G) TELEPHONE: 619-554-2937 

(H) TELEFAX: 619-554-6312 

(ii) TITLE OF INVENTION: METHODS FOR PRODUCING POLYPEPTIDE METAL 
BINDING SITES AND COMPOSITIONS THEREOF 

(iii) NUMBER OF SEQUENCES: 65 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: PatentIn Release #1.0. Version #1.25 (EPO) 

(v) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT/US 94/ 

(B) FILING DATE: Ol-FEB-1994 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/077,797 

(B) FILING DATE: 14-JUN-1993 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/012,566 

(B) FILING DATE: 02-FEB-1993 

(2) INFORMATION FOR SEQ ID N0:1: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 
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(XL) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

Ser Arg Arg Ser Arg His His Pro Arg Met Trp Asn Gly Leu Asp Val 
1 5 10 

t 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
Gly Arg Fhe Lys Arg Val Arg Asp Arg Trp Val Val He Phe Asp Phe 



1 5 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:3: 

Gly Val Ala Arg Ser Lys Lys Met Arg Gly Leu Trp Arg Leu Asp Val 
1 5 10 15 

(2) INFORMATION FOR SEQ ID N0:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(v) FRAGMENT TYPE: internal . 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:^: 

GlY Leu Ala Val Arg Ser Lys Arg Gly Arg Phe Phe Leu Phe Asp Val 

. 1 5 .10 15 : 

(2) INFORMATlbN FOR SEQ ID N0:5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:5: 

GlY Arc Val His His His Ser Leu Asp Val 
1 • 5 10 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:6: 

Ser Trp Lys His His Ala His Trp Asp Val 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Gly Ser Trp Asp His Arg Gly Cys Asp Gly 
1 5 - 10 

(2) INFORMATION FOR SEQ ID NO: 8: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Gly His His Met Tyr Gly Gly Trp Asp His 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Gly His Trp Gly Arg His Ser Leu Asp Thr 
1 5 .10 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Gly His lie Leu His His Gin Leu Asp Leu 

1 5. .10 

(2) INFORMATION FOR SEQ ID NO ill: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Ser Ser Gin Arg Leu Met Leu Gly Asp Asn 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Ser His His Gly His His Tyr Leu Asn His 
15 10 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: ■ 
Gly Lys Leu Met Met Ser Trp Cys Arg Asp Thr Glu Gly Cys Asp 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Gly Asp Thr His Arg Gly His Leu Arg His His Leu Pro His Asp Trp 
1 5 10 J-^ 



(2) INFORMATION FOR SEQ ID N0:15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
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(2) INFX)RMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOlJOGY: linear 

(ii) MOLECULE TYPE: protein 

(y) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
Ser His Thr His Ala Leu Pro Leu Asp Phe 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE:, internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Gly Arg Val His His His Ser Leu Asp Val 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 



PCT/US94/01238 

WO 94/18220 

- 110 - 

Gly Gin Ser Ser Gly Gly Asp Thr Asp Asp 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:19: 
Gly Gin Trp Thr Pro Arg Gly Asp Asp Phe 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Gly Arg Cys Cys Pro Ser Ser Cys Asp Glu 
1 5 10 

(2) INFORMATION FOR SEQ ID N0:2i: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 




(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID.NO:22: 

Gly Asn Leu Arg Arg Lys Thr Ser Asp He 
1 5 

(2) INFORMATION FOR SEQ ID NO: 23: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

Gly Glu Ser Asp Ser Lys Arg Glu Asp Gly 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Gly Gly Pro Ser Leu Ala Val Gly Asp Trp 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Glv Pro Leu Gin His Thr Tyr Pro Asp Tyr 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

GlY Trp Lys Val Thr Ala Glu Asp Ser Thr Glu Gly Leu Phe Asp Leu 
15 10 15 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(V) FRAGMENT TYPE: internal 
(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

Gly Thr Arg Val Trp Arg Val Cys Gin Trp Asn His Glu Glu Asp Gly 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 
. (B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 

t 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Gly Glu Trp Trp Cys Ser Phe Ala Met Cys Pro Ala Arg Trp Asp Phe 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

Gly Asp Thr He Phe Gly Val Thr Met Gly Tyr Tyr Ala Met Asp Va 
1 5 10 ^5 

(2) INFORMATION FOR SEQ ID NO: 30: 

- (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 
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(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

Glv Gin Val Met Gin Glu Leu Gly Asp Ala 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

Glv Leu Thr Glu Gin Gin Leu Gin Asp Gly 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Gly Tyr Ser Tyr Ser Val Ser Pro Asp Ala 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 33: 



wo 94/18220 



PCT/US94/01238 



- 115 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

Gly Arg Leu Gly Leu Val Met Thr Asp Glu 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids , 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

Ser Thr Trp Pro Gly Arg Gin Arg Leu Gly Gin Ala Leu Ser Asp Ser 
- 3 10 1-^ 



(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

GlY Tyr Glu Leu Ser Trp Gly Val Asp Gin Gin Glu Trp Trp Asp He 
- ^ 5 10 15 



1 
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(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

Gly Pro Val Arg Gly Leu Asp Gin Ser Lys Gly Val Arg Tyr Asp 
15 10 

(2) INFORMATION FOR SEQ ID N0:37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

Leu Ser Gin His He Val Ser Glu Thr Gin Ser Ser Gly Asp Leu 
5 10 15 



(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 



(xi) 

Gly 
1 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

Glv Leu Glu Ser Leu Lys Val Leu Gly Val Gin Leu Gly Gly Asp Leu 
1 5 10 15 

(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

Gly Asn Met He Leu Gly Gly Pro Gly Cys Trp Ser Ser Ala Asp He 
^ ' 10 15 



1 3 

(2) INFORMATION FOR SEQ ID N0:40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:40: 

Gly Cys Trp Asn Val Gin Arg Leu Val Val Tyr His Pro Pro Asp Gly 
1 ^ ^ ^ 5 10 15 



(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
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(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

Glv Phe Glu Val Thr Cys Ser Trp Phe Gly His Trp Gly Arg Asp Ser 
15 10 15 



(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

Ser Ala Ser Met Arg Ser Ala He Gly Leu Trp Arg Thr Met Asp Tyr 
15 10 15 



(2) INFORMATION FOR SEQ ID NO:43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

Glv Asp Arg Glu He Phe His Met Gin Trp Pro Leu Arg Val Asp Val 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 44: . . 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 
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(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

Ser Gin Asn Pro Gin Gin Val Cys Gly Val Arg Cys Gly Gin Asp Lys 
1 5 10 15 



(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 
..(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

Glv Asn Are Leu Ser Ser Gly His Leu Leu Lys Gin Gly Gin Asp Gly 
15 10 15 



(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID HO:46: 

Gly Gly Ser Asp Trp Gin He Gly Ala Cys Cys Arg Glu Asp Asp Leu 
15 10 15 
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(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(V) FRAGMENT TYPE: internal 

(Ki) SEQUENCE DESCRIPTION: SEQ ID NO:47: 

Gly Met Val Ser Met Met Gly Gin Ser Arg Pro Thr Gin Cys Asp Cys 

(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:48: 

Gly Val He Lys Trp He Arg Arg Trp Val Arg Thr Ala Arg Asp Val 
1 5 10 15 

(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:49: 
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Gly Trp Phe Trp Arg Leu Leu Pro Thr Pro Arg Ala Pro Ser Asp Val 
1 5 10 



(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 
GTGTATTATT GTGCGAGANN SNNSNNSNNS NNSTGGGGCC AAGGGACCAC G 
(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 
GTGTATTATT GTGCGAGANN SNNSNNSNNS NNSNNSNNSN NSNNSNNSTG GGGCCAAGGG 60 

66 

ACCACG 

(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ill) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 
GCCGTGTATT ACTGTGCGAG AGGTNNKNNK NNKNNKNNKN NKNNKGACNN KTGGGGCCAA 60 
GGGACCACGG TO 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 
GCCGTGTATT ACTGTGCGAG AGGTNNKNNK NNKNNKNNKN NKNNKNNKNN KNNKNNKNNK 60 
NNKGACNNKT GGGGCCAAGG GACCACGGTC 90 
(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 
CGGGTGTATT ACTGTGCGAG ANNKNNKNNK GACNNKTGGG GGCAAGGGAC CACGGTC 
(2) INFORMATION FOR SEQ ID NO:55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
. (iii) HYPOTHETICAL: NO 
(iv) ANTl- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 
GCAATTAACC CTCACTAAAG GG 
(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:56: 
TCTCGCACAA TAATACACGG C 
(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:57: 
GCATGTACTA GTTTTGTCAC AAGATTTGGG 
(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 
(G) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 
TCTCGCACAG TAATACACGG CCGT 
(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FEW^GMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

Xaa Xaa Xaa Asp Xaa 
1 5 

(2) INFORMATION FOR SEQ ID NO: 60: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 
TTGATATTCA CAAACGAATG G 
(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

Gly Xaa Xaa Xaa Xaa Xaa Xaa Xaa Asp Xaa 
15 10 

(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

Gly Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Asp Xaa 
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10 



15 



(2) INFORMATION FOR SEQ ID NO:63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(ix) FEATURE: 

(A) NAME/KEY: repeat^region 

(B) LOCATION: 19. .21 

(D) OTHER INFORMATION: /rpt_type- "direct" 
/labels NNS 

/note* "The NNS triplet degeneracy is repeated 
depending on the desired length of the CDR." 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:63: 
GTGTATTATT GTGCGAGANN STGGGGCCAA GGGACCACG 
(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(ix) FEATURE: 

(A) NAME/KEY: repeat^region 

(B) LOCATION: 25. .27 

(D) OTHER INFORMATION: /rpt^type- "direct" 
/labels NNS 

/note= "The NNK triplet degeneracy is repeated 
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depending on the desired length of the CDR." 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 
GCCGTGTATT ACTGTGCGAG AGGTNNKGAC NNKTGGGGCC AAGGGACCAC GGTC 
(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDMESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(ix) FEATURE: 

(A) NAME/KEY: repeat_region 

(B) LOCATION: 22. .24 

(D) OTHER INFORMATION: /rpt_type- "direct" 
/label- NNS 

/note= "The NNK triplet degeneracy is repeated 
depending on the desired length of the CDR." 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 
CGGGTGTATT ACTGTGCGAG ANNKGACNNK TGGGGCCAAG GGACCACGGT C 



54 



51 



PCT/US94/01238 

WO 94/18220 

- 128 - 



What Is Claimed Is: 

1. A method for producing a metal binding site 
in a polypeptide capable of binding a preselected 
5 metal ion-containing molecule comprising inducing 

mutagenesis of a complementarity determining region 
(CDR) of an immunoglobulin heavy or light chain gene 
by amplifying the CDR of said gene by a primer 
extension reaction using a primer oligonucleotide, 
10 said oligonucleotide having 3' and 5' termini and 

comprising: 

a) a nucleotide sequence at said 3 • 

terminus capable of hybridizing to a first framework 

region of an immunoglobulin gene; 
25 b) a nucleotide sequence at said 5 ' 

terminus capable of hybridizing to a second framework 

region of an immunoglobulin gene; and 

c) a nucleotide sequence between said 3* 

and 5' termini according to the formula: 
20 [NNS]^, 

wherein N is independently any nucleotide, S is G or C 

or analogs thereof, and a is from 3 to about 50, and 

said 3' and 5' terminal nucleotide sequences having a 

length of about 6 to 50 nucleotides, or an 
25 oligonucleotide having a sequence complementary 

thereto . 

2. The method of claim 1 wherein said 
immunoglobulin is human. 

3. The method of claim 1 wherein said CDR is 

30 CDR3. 

4. The method of claim 1 that further comprises 

the steps of: 

a) isolating the amplified CDR to form 
mutagenized immunoglobulin genes; 
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b) expressing the isolated mutagenized 

imiBunoglobulin genes; and 

c) selecting species of the expressed 
mutagenized immunoglobulin genes for the ability to 

5 bind a . preselected metal ion-containing molecule. 

5. The method of claim 1 wherein said 
oligonucleotide has the formula: 

5 ' -GTGTATTATTGTGCGAGA [ NNS ] ^TGGGGCCAAGGGACCACG-3 • ( SEQ 
ID NO 63), or an oligonucleotide having a sequence 

10 thereto. 

6. The method of claim 5 wherein a is 5 (SEQ ID 

NO 50) . 

7. The method of claim 5 wherein a is 10 (SEQ 
ID NO 51) . 

15 8. The method of claim 1 wherein said 

preselected metal ion-containing molecule is selected 
from the group consisting of magnetite, copper (II), 
zinc(II), lead(II), cerium (III) , and iron(III). 

9. A method for producing a metal binding site 

20 in a polypeptide capable of binding a preselected 

metal ion-containing molecule comprising inducing 
mutagenesis of a complementarity determining region 
(CDR) of an immunoglobulin heavy or light chain gene 
by amplifying the CDR of said gene by a primer 

25 extension reaction using a primer oligonucleotide, 

said oligonucleotide having 3' and 5» termini and 

comprising: 

a) a nucleotide sequence at said 3' 

terminus capable of hybridizing to a first framework 
30 region of an immunoglobulin gene; 

b) a nucleotide sequence at said 5' 
terminus capable of hybridizing to a second framework 
region of an immunoglobulin gene; and 

c) a nucleotide sequence between said 3 • 
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and 5' termini according to the formula: 

-X-[NNK].-X-[NNK]-X-, 
wherein N is independently any nucleotide. K is G or T 
or analogs thereof, X is a trinucleotide encoding a 
native amino acid residue coded by said immunoglobulin 
gene and a is from 3 to about 50, and said 3« and 5' 
terminal nucleotide sequences having a length of about 
6 to 50 nucleotides, or an oligonucleotide having a 
sequence complementary thereto. 

10. The method of claim 9 wherein said 
immunoglobulin is human. 

11. The method of claim 9 wherein said CDR is 

CDR3. 

12. The method of claim 9 that further comprises 

15 the steps of: 

a) isolating the amplified CDR to form 

mutagenized immunoglobulin genes; 

b) expressing the isolated mutagenized 

immunoglobulin genes; and 
20 c) selecting species of the expressed 

mutagenized immunoglobulin genes for the ability to 
bind a preselected metal ion-containing molecule. 

13. The method of claim 9 wherein said 
oligonucleotide has the formula: 

25 5 • -GCCGTGTATTACTGTGCGAGAGGT [ NNK] .GACNNKTGGGGCCAAGGGACC 

ACGGTC-3' (SEQ ID NO 64). 

14. The method of claim 13 wherein a is 7 (SEQ 

ID NO 52) . 

15. The method of claim 13 wherein a is 13 (SEQ 

30 ID NO 53) . 

16. The method of claim 9 wherein said 
oligonucleotide has the formula: 

5 • -CGGGTGTATTACTGTGCGAGA[NNK] .GACNNKTGGGGCCAAGGGACCACG 
GTC-3' (SEQ ID NO 65). 
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17. The method of claim 16 wherein a is 3 (SEQ 
ID NO 54) . 

18. The method of claim 9 wherein said 
preselected metal ion-containing molecule is selected 
from the group consisting of magnetite, copper (II), 
zinc(II), lead(II), cerium(III), and iron { III ) . 

19. A human monoclonal antibody produced 
according to the methods of claim 1* 

20. A human monoclonal antibody produced 
according to the methods of claim 9. 

21. A human monoclonal antibody capable of 
immunoreacting with magnetite, wherein said monoclonal 
antibody has the binding specificity of a monoclonal 
antibody comprising a heavy chain immunoglobulin 
variable region amino acid residue sequence selected 
from the group consisting of SEQ ID NOs 1-4 and 
conservative substitutions thereof. 

22. A human monoclonal antibody capable of 
immunoreacting with copper (II) , wherein said 
monoclonal antibody has the binding specificity of a 
monoclonal antibody comprising a heavy chain 
immunoglobulin variable region amino acid residue 
sequence selected from the group consisting of SEQ ID 
NOs 5-15 and conservative substitutions thereof. 

23. A human monoclonal antibody capable of 
immunoreacting with zinc (II), wherein said monoclonal 
antibody has the binding specificity of a monoclonal 
antibody comprising a heavy chain immunoglobulin 
variable region amino acid residue sequence selected 
from the group consisting of SEQ ID NOs 16-21 and 
conservative substitutions thereof. 

24. A human monoclonal antibody capable of 
immunoreacting with lead(II) , wherein said monoclonal 
antibody has the binding specificity of a monoclonal 
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antibody comprising a heavy chain immunoglobulin 
variable region amino acid residue sequence selected 
from the group consisting of SEQ ID NOs 22-29 and 
conservative substitutions thereof. 

25. A human monoclonal antibody capable of 
immunoreacting with cerium (III) , wherein said 
monoclonal antibody has the binding specificity of a 
monoclonal antibody comprising a heavy chain 
immunoglobulin variable region amino acid residue 
sequence selected from the group consisting of SEQ ID 
NOs 30-41 and conservative substitutions thereof. 

26. A human monoclonal antibody capable of 
immunoreacting with iron (III), wherein said monoclonal 
antibody has the binding specificity of a monoclonal 
antibody comprising a heavy chain immunoglobulin 
variable region amino acid residue sequence selected 
from the group consisting of SEQ ID NOs 42-49 and 
conservative substitutions thereof. 

27. A DNA expression vector capable of 
expressing a phagemid immunoglobulin display protein 
comprising a polynucleotide sequence that codes an 
immunoglobulin heavy chain polypeptide that includes a 
metal binding site selected from the group consisting 
of SEQ ID NOs 1-49. 

28. The DNA expression vector of claim 27 
wherein said vector is the plasmid pC3AP313. 

29. A host cell comprising the DNA expression 
vector of claim 28. 

30. The DNA expression vector of claim 27 
wherein said vector is the plasmid p7EIII. 

31. A host cell comprising the DNA expression 

vector of claim 30. 

32. A pharmaceutical composition comprising at 
least one dose of an immunotherapeutically effective 
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amount of the monoclonal antibody of claims 21, 22, 
23, 24, 25, and 26 in a pharmacological carrier. 

33. An oligonucleotide useful as a primer for 
producing a metal binding site in a polypeptide coded 

5 for by an immunoglobulin heavy or light chain gene, 

said oligonucleotide having 3* and 5' termini and 

comprising: 

a) a nucleotide sequence at said 3' 
terminus capable of hybridizing to a first framework 

10 region of an immunoglobulin gene; 

b) a nucleotide sequence at said 5* 
terminus capable of hybridizing to a second framework 
region of an immunoglobulin gene; and 

c) a nucleotide sequence between said 3* 
15 and 5' termini according to the formula: 

[NNS]^, 

wherein N is independently any nucleotide, S is G or C 
or analogs thereof, and a is from 3 to about 50, and 
said 3' and 5* terminal nucleotide sequences having a 
20 length of about 6 to 50 nucleotides, or an 

oligonucleotide having a sequence complementary 
thereto . 

34. The oligonucleotide of claim 33 wherein said 
immunoglobulin is human. 

25 35. The oligonucleotide of claim 33 wherein said 

CDR is CDR3. 

36. The oligonucleotide of claim 33 wherein said 
oligonucleotide has the formula: 

5 • -GTGTATTATTGTGCGAGA [ NNS ] ^TGGGGCCAAGGGACCACG-3 • ( SEQ 
30 ID NO 63), or an oligonucleotide having a 

complementary sequence thereto. 

37. The oligonucleotide of claim 36 wherein a is 

5 (SEQ ID NO 50) . 

38. The oligonucleotide of claim 36 wherein a is 
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10 (SEQ ID NO 51) . 

39. An oligonucleotide useful as a primer for 
producing a metal binding site in a polypeptide coded 
for by an immunoglobulin heavy or light chain gene, 

5 said oligonucleotide having 3* and 5' termini and 

comprising: 

a) a nucleotide sequence at said 3' 

terminus capable of hybridizing to a first framework 

region of an immunoglobulin gene; 
IQ b) a nucleotide sequence at said 5' 

terminus capable of hybridizing to a second framework 

region of an immunoglobulin gene; and 

c) a nucleotide sequence between said 3* 

and 5' termini according to the formula: 
15 -X- [ NNK ] a-X- [ NNK ] -X- , 

wherein N is independently any nucleotide, K is G or T 

or analogs thereof, X is a trinucleotide encoding a 

native amino acid residue coded by said immunoglobulin 

gene and a is from 3 to about 50, and said 3* and 5' 
20 terminal nucleotide sequences having a length of about 

6 to 50 nucleotides, or an oligonucleotide having a 
sequence complementary thereto. 

40. The oligonucleotide of claim 39 wherein said 
immunoglobulin is human. 

25 41. The oligonucleotide of claim 39 wherein said 

CDR is CDR3. 

42. The oligonucleotide of claim 39 wherein said 
oligonucleotide has the formula: 

5 • -GCCGTGTATTACTGTGCGAGAGGT [ NNK] ^GACNNKTGGGGCCAAGGGACC 
30 ACGGTC-3' (SEQ ID NO 64). 

43. The oligonucleotide of claim 42 wherein a is 

7 (SEQ ID NO 52) . 

44. The oligonucleotide of claim 42 wherein a is 

13 (SEQ ID NO 53) . 
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45. The oligonucleotide of claim 39 wherein said 
oligonucleotide has the formula: 

5 • -CGGGTGTATTACTGTGCGAGA[NNK] .GACNNKTGGGGCCAAGGGACCACG 

GTC-3' (SEQ ID NO 65). 

46. The oligonucleotide of claim 45 wherein a is 

3 (SEQ ID NO 54) . 
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